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CHAPTER  1 


INTRODUCTION 

1.1.  An  Overview. 

The  commonly  used  multivariate  techniques  of  principal  components, 
factor  analysis,  canonical  analysis,  multivariate  analysis  of  variance, 
and  discriminant  analysis  are  based  on  latent  roots  and  latent  vectors  of 
random  matrices.  Unfortunately,  even  under  the  usual  assumption  that  the 
observations  have  been  drawn  from  a multivariate  normal  population,  the 
exact  sampling  distributions  of  the  sample  latent  roots  and  sample  latent 
vectors  are  generally  so  complicated  that  they  are  practically  useless  for 
computational  or  inferential  purposes.  The  probability  density  functions, 
and  hence  the  likelihood  functions,  are  very  difficult  to  evaluate 
numerically  and  are  analytically  intractable.  Also  it  is  not  at  all  clear 
how  one  should  draw  inferences  about  a subset  of  the  population  parameters, 
the  remaining  population  parameters  being  nuisance  parameters.  Asymptotic 
approximations  to  the  exact  sampling  distributions  appear  to  be  necessary 
and  have  produced  seme  useful  results  (see  for  example  the  work  of 
Anderson  (1965)  and  James  (1969)  in  principal  components). 

This  research  considers  sample  latent  roots,  calculated  from  a sample 
from  a multivariate  normal  population,  which  arise  in 

(i)  canonical  correlation  analysis,  and  in 

(ii)  multivariate  analysis  of  variance  (MANOVA ) and  discriminant 
analysis. 

The  exact  probability  density  functions  (pdf's)  of  the  sample  roots  in 
both  of  these  cases  involve  hypergeometric  functions  of  matrix  arguments. 
Constantine  (1963)  found  a representation  of  these  hypergeometric  func- 
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tiona  as  a aeries  of  zonal  polynomials,  but  in  general  the  zonal  poly- 
nomials are  very  difficult  to  calculate  and  the  series  converge  very 
slowly.  Asymptotic  expansions  of  the  hypergeometric  functions  are  derived 
and  these  expansions  are  then  used  to  obtain  expansions  of  the  pdf's,  and 
hence  of  the  likelihood  functions.  The  resulting  expansions  are  used  to 
study  inference  problems.  In  canonical  correlation  analysis,  estimates 
of  the  population  coefficients  are  obtained  and  the  Bartlett-Lawley  tests 
that  the  residual  population  roots  are  zero  are  examined.  In  MANOVA  and 
discriminant  analysis,  Bartlett  s test  that  the  residual  population  roots 
are  zero  is  examined. 

The  rest  of  this  chapter  contains  a review  of  some  basic  facts  about 
zonal  polynomials  and  hypergeometric  functions  of  matrix  arguments,  a 
review  of  the  main  distribution  and  testing  results  in  canonical  correla- 
tions and  MANOVA  and  discriminant  analysis,  and  a brief  summary  of  the 
remaining  chapters  and  notation. 


1.2.  Zonal  Polynomials  and  Hypergeometric  Functions  of  Matrix  Arguments. 

Let  S and  T be  ra  x m complex  symmetric  matrices  and  let 
. . .,a  be  complex  constants.  The  hypergeooetric  functions 

pFq(a j * • • •» jbj , . • • , bq ;S ) and  pF^Ca^, . . .,a^;b^,  . . .,b^;S,T)  are  complex 
valued  symmetric  functions  of  the  latent  roots  of  S and  S and  T 
respectively.  Wlien  the  values  of  the  a^'s  and  b^'s  are  clear  from  the 
context,  we  will  denote  ^F^a^  . . ap;bx,  . . .,bq;S ) by  pFq(S)  803 
pFqUj,  ...,ap;bj,  ...,bq;S,T)  by  pFq(S,T)  . Also,  when  it  is  important 
to  emphasize  the  dimension  of  S and  T we  will  write 
pFq(n)(aj» • • *>ap»bj,  • . *»bq;S)  (or  pFq(m,(S))  and 
p7q  ^ (a^ , . . Spjbj , . . ., bq ;S, T ) (or  pFq  (S,T))  . 


— 1 
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Herz  (1955)  developed  the  general  system  of  hypergeometric  functions 
of  one  matrix  argument  by  means  of  multidimensional  Laplace  and  inverse 
Laplace  transforms.  Constantine  (1963)  found  a representation  of  hyper- 
geometric functions  as  a series  involving  zonal  polynomials.  We  take 
Constantine’s  series  representation  as  our  definition. 


Definition  — 

pFq(ai'  " *,ap;bi'  ' 


(a,  ) * * - ( a ) C (S) 
.c*  \ P K H 

”'VS)  ■ * s fb,)  ...(b)  t r~ 

k~0  k la  q a 


where 


m 

(«>K  - n (a-  ^ (i-l))k 
i=l  i 


and 


(a)k  = a(a+l )• • • (a+k-l ) . 

C^(S)  denotes  the  zonal  polynomial  of  S corresponding  to  the 
partition  h « (k  , ...,k  ) , k > k > ••■  > k > o and 

i m l — d — — m — 

kj  + k?  + • • • + km  - k , of  k into  not  more  than  m parts.  C^tS) 
is  a symmetric  homogeneous  polynomial  of  degree  k in  the  latent  roots 
of  S . A detailed  discussion  of  zonal  polynomials  is  presented  in  James 
(1961,1964)  and  Constantine  (196 

If  p > q + l , the  series  2.1 ) may  only  converge  for  S - 0 . 

If  p - q + 1 , the  series  converges  absolutely  for  |s|  < l where  |s| 
denotes  the  maximum  of  the  absolute  values  of  the  latent  roots  of  S . 

If  p < q , the  series  converges  ror  all  S . The  constants 

ai  > i"1*****?  » ®nd  b.  , j-l,...,q  , are  arbitrary  except  that  the 

bj’s  must  not  be  integers  or  half  integers  < ^(m-i ) . 

James  (1964)  extended  (1.2.1)  to  define  hypergeometric  functions  of 


two  matrix  arguments  by; 
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Definition  — 


00  (a,  ) ‘"(a  ) C (S)C  (T) 

„.2.2,  pVv...,v„...,v,T 

where  I is  the  m x m identity  matrix, 
m 

Zonal  polynomials  and  hypergeoraetric  functions  of  products  ST  of 
symmetric  matrices  are  defined  as  the  corresponding  zonal  polynomials  or 

Note 


hypergeometric  functions  of  the  symmetric  matrix  S^TSg 

t 1 1 T 

that  ST,  TS,  T^ST~  , and  S®TS?  have  the  same  latent  roots. 
James  (1961)  proved  that 

® C (S) 


h h 

or  TsST2 


oFo(S) 


2 2 

k-o  K 


k: 


etr (S ) 


where 


etr(S)  5 expftrace  S)  . 

Herz  (1955)  derived  the  general  system  of  hypergeometric  functions 
from  0fq(S)  by  means  of  multidimensional  Laplace  and  inverse  Laplace 
transforms.  Let  f(S)  be  a function  of  the  m x m positive  definite 
symmetric  matrix  S . The  Laplace  transform  g(Z)  of  f(S)  is  defined 
as 

g(Z)  - / etr(-SZ)f(S)(dS) 

S>0 

where  Z - X + iY  is  an  m x m complex  symmetric  matrix,  X and  Y are 
real  symmetric  matrices,  and  the  integral  is  assumed  to  converge  in  the 
half  plane  Re(Z)  - X > X0  for  some  positive  definite  XQ  . The  notation 
X > XQ  means  that  X - XQ  is  positive  definite.  The  inverse  Laplace 
transform  is  defined  by 

£m(m-l ) 

f(S)  - — / etrfSZ  )g(7 ) (dZ ) . 

(2TTi)*n(m+1  ] Ke(Z)>X  >0 
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The  Integral  is  taken  over  Z - X + iY  with  X fixed,  X > X , and 

o ’ o 

Y ' trying  over  all  real  symmetric  matrices.  See  Herz  (1955)  for  details. 
Herz  proved  the  following: 

<l,J  3)  p+iFq(ai» '••,ap'a;bi' •*''bq;S)  ' 

f-TT,  / ®trHF)det  Ta-&(m+l  1 F (a  , .,a  ;b  , . . .,b  ;ST)(dI) 
m a t>o  p q i p i q 

wher  a > ${m-i ) , where 


U.5  «») 


rm(a)  - n r(a-£(i-l)) 

iee  1 


is  t e multivariate  gamma  function;  and 
pf  +i(ai' '•',ap;bi» •••>bq>biS)  - 

>r  (b ) 

BelT{.x^1Iiaet  T'  pW-'Vh V*’ s,ldT1  • 

We  will  need  the  following  results  which  were  proved  by  James  (1964), 

1.  f S - disgJS^O)  where  s is  m x m , Sj  is  a k x k complex 
symn  trie  matrix,  and  0 is  the  (m-k)  x (m-k)  zero  matrix,  then 

(1,z  5>  pFq(  *(»,*  ••  .pftpjbj, . ..pb^jS)  - pFq(  * (a^  . . a^b^  . . ^b^-.Sj  ) . 

2.  et  0(m)  denote  the  group  of  orthogonal  m x m matrices  and  let 

(dH)  be  the  invariant  measure  on  0(m)  normalized  so  that 

/ (dH)  - l . 

0(m) 


(1.2  6) 


^(®j»  • ».»®_;bj»  . ..,b  ;S»T)  » 

/ pFq(aj > • • • »®p;bj» . . .,b  ;SH'TH)(dH) 
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5.  Bessel's  integral  -- 


Let  X be  an  i x m matrix,  t <_  m , and  let  H - [Hj  :H  ] € 0(m) 


where  Hx  is  m x 1 and  H?  is  m x (m-C)  . Then 


(1.2.7) 


/ etr(XH  }(dH)  - P (injjxx'j  . 
C(m) 


A detailed  discussion  of  hypergeometrlc  functions  of  matrix  argument 
can  be  found  in  Here  (1955),  Constantine  (1963),  and  James  (1964). 


1.3.  Canonical  Correlation  Coefficients. 

Canonical  correlations  were  introduced  by  Hotelling  ( 1 9 56  ) as  a means 
of  measuring  the  relation  between  two  sets  of  variates.  Let  x be  a 
p x l random  vector  with  mean  u ar.d  covariance  matrix  3^  j . Let  y 
be  a q x l random  vector  ( q>p ) with  mean  \ and  covariance  matrix 
Z22  . Let  2^  (pxq)  be  the  matrix  of  covariances  between  x and  y . 
Hotelling  proved  that  there  exist  nonsingular  matrices  A (pxp)  and 
B (qxq)  such  that  if  u » Ax  and  v .»  By  then  the  covariance  matrix  of 
the  random  (p+q)-vector  (^ ) is 


TP  • P 


where  p > diag( P , . . p ) is  a diagonal  matrix  with 
l P 

l > p > p > ...  > p >c.  The  p.'s  are  the  canonical  correlation 

— l — ? — — p ~ i 

coefficients  . It  is  easily  shown  that  the  canonical  correlation 
coefficients  satisfy  the  determinants!  equation 


L 
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Let  X = and  Y - [y1,‘**>yN]  > where  and 

l N 

y are  random  samples  of  size  N ~ n + l from  x and  y respec- 

tively. The  matrix  S of  corrected  sums  of  squares  and  sums  of  products 
(abbreviated  as  corrected  s.s.  and  s.p.  matrix)  of  Cx'‘.Y']'  , is  defined 


■-[?] 


where  E , denotes  an  a x b matrix  of  ones.  Partition  S as  follows 

8.  D 


SU  S12 

S21  S22 
p q 


S12  " V 


where  Sn  is  the  corrected  s.s.  and  s.p.  matrix  of  X , S22  is  the  cor- 
rected s.s.  and  s.p.  matrix  of  Y , and  S is  the  corrected  s.p.  matrix 
of  X and  Y . 

Assume  that  x and  y have  a joint  multivariate  normal  distribution. 

2 2 2 

Then  the  maximum  likelihood  estimates  l > r,  >r  > •••  > r >o  of 

- l - 2 - - p - 

2 

the  's  are  the  roots  of  the  determinantal  equation 

det(S12S22‘1S2i-r2S1i)  - o . 

2 

Note  that  if  the  p^'s  are  strictly  less  than  l then  the  r^  are  distinct 
and  between  0 and  l with  probability  l. 

In  the  following  discussion  x and  y are  assumed  to  have  a joint 
normal  distribution. 

Fisher  (l?39),  Roy  (1939),  and  Hsu  (1939)  found  the  joint  null  dis- 

2 2 

tribution,  corresponding  to  P - 0 , of  ^ ,...,r^  . Hsu  (1941b)  found 

the  limiting  joint  pdf  as  n -•  00  of  Zj,...,z^  where 

n^(ri2-pi2 ) 

1 2pi(l-pi2) 


if 


Pt  > o 
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Asymptotically  the  z.  's  corresponding  to  different  p^'s  are  independent. 
The  z^s  corresponding  to  equal  nonzero  ' s have  a Joint  "normal  type" 


limiting  distribution  while  the  z^'s  corresponding  to  p^  . o have  a 


joint  "chi-square  type”  limiting  distribution.  Constantine  (1963)  derived 


the  exact  joint  density  of  r , , r " when  n > p + 


Let 


P - diagfPj,  ...,  Pp)  and  R ~ diag(r, , . . .,  r^}  , then  f(R  ) , the  joint 


pdf  of  r , . . ., r 

.2 


p o ! 

(1.3.1)  f(R^)  = !!  (l-p  )'-  F,  (.|n,in:fq;p2,R2) 

it  J-  *-  * 


i-i 


x C 


, n (,r  L+in-q-p-.l,  J , * ,r  Vndr 

1 i-1  1 1 i<J  1 ■>  i->  1 


2 2 2 
> r > r > • • • > r >o 

— 1—  2—  — p — 


where 


{hn  )t 


*P2 


rpf*(n-q))rp(4q)r  (fro 


and  F is  the  multivariate  gamma  function  defined  by  (1.2.4). 


Chattopadhyay  and  Pi 11a i (1973)  and  Chattopadhyay,  Pillai,  and  Li  (1976) 


found  an  asymptotic  expansion  as  n - 0)  of  the  ?Fj  hypergeometric  func- 


tion in  (1.3.1).  Unfortunately  their  result  involves  a F function 

2 1 


2 2 


with  the  matrix  PR  as  argument.  Sugiura  (1976),  using  a perturbation 

argument,  calculated  terms  up  to  and  including  terms  of  order  n"1  in  the 

-l 


joint  pdf  z, , . . ., z 

1 P 


If  the  terms  of  order  n are  ignored,  then  the 
Zj's  corresponding  to  distinct-  P.’s  are  independent . 


1 


V 
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Typically  in  canonical  correlation  analysis  one  ir-  initially  interested 
in  testing  the  null  hypothesis  H that  all  the  population  canonical 
correlation  coefficients  are  zero  versus  the  alternative  that  they  are  not 
all  zero.  H is  equivaler  • ! - null  hypothesis  that  x and  y are 

independent.  The  likelihood  .•  .tic  statistic  I.  for  testing  H.  is 
defined  by 


, '-'N 
L' 


1 2 
T (l-r.  ) 

i-l 


Bartlett  (’.958)  showed  that  -{ n-^tp+T+l  )]lnT.(  (where  n « N - l)  is 

approximately  distributed  as  with  pq  degrees  of  freedom  (df).  The 

factor  n - -^(p+q+l  ) was  chosen  so  that  the  resulting  statistic  has  the 

-2  2 

same  moments  up  to  order  N as  on  pq  df.  An  asymptotic  expan- 

sion for  the  distribution  function  of  Bartlett’s  statistic  has  been  given 
by  Box  ( 1 9^9 ) and  Anderson  0 95$,  Chapter  9). 

If  Hq  is  rejected  then  x ard  y are  correlated  and  often  one 
would  be  interested  in  sumraarir  nr,  the  relationship  between  x and  y 
by  means  of  a few  canonical  variables.  Bartlett  (1933,  1991,  1947a)  sug- 
gested that  the  null  hypo1  bonis  H , that  the  residual  p-k  canonical 
correlation  coefficients  are  zero  v rn  tie  largest  k population  coeffi- 
cients, corresponding  U real  relat (onships  between  x and  y , have 
been  removed,  can  be  tested  with  the  statistic  (n  - i(p^+i))T.  where 

K 


T,  » -In  n (l-r.  ) , 

i»k+i 

When  is  true  (r.  - '(p+q  + l )}TV  is  approximately  distributed  as  \ 

with  (p-k)(q-k)  df.  If  is  a rented  then  the  canonical  variables 

corresoonding  to  ,>  ave  :ic  predictive  value  and  the  relation- 

K ^ 1 P 

ship  between  x and  y may  b®  summarized  by  means  of  the  first  k 


i 


canonical  variables.  Ir.  this  test  the  V largest  copulation  correlation 
coefficients  are  nuisance  parameters.  La  vie  y (1959)  examined  their  effect 
on  by  a complicated  expansion  of  the  matrices  involved  in  the  calcula- 

tion of  the  r s.  He  concluded  that  for  large  n 


-2 

{n  - k - .(rmp!  ) + 2 p.  ')T^ 

2 

is  approximate  ’ y \ with  ip-!;!{q-k)  df.  The  correction  factor  was 

chosen  so  that  the  resulting  statist  c has  the  same  moments  up  to  order 
-2  2 

N as  x with  (p-k)(q~k)  di.  Since  in  practice  one  does  not  know 
the  values  of  the  k largest  population  canonical  correlation  coefficients, 
Law ley  suggested  that  tee  statistic 

k 

{n  - k - c(p+q+i)  + T,  r,  ‘ }T. 

i~l  : k 

should  be  used.  Little  appears  to  be  known  clout  the  accuracy  of  such  an 


approximation . 


l. 4.  Multivariate  Analysis  of  Variance  and  Discriminant  Analysis. 

Let  be  <0  ■ n + 1 independent  p-variate  (n^p)  normally 

distributed  random  vectors  such  that  x has  mean  p.1  and  covariance 

matrix  2 , i-l,...,-  . Sutpose  one  wants  to  test  the  null  hypothesis 

„ l ? c 

H o : u.  » u **•••««  u 

of  the  equality  of  the  P mean  vectors  on  the  basis  of  i independent 

random  samples  of  sizes  m,  (i-i,  2, . . .,  ij  from  the  l populations. 

Define  M«m,  +m  + — + m . Let  X,  be  the  p x m.  matrix  of  the 
12  t i 1 

m.  sample  observations  from  x1  . 

1 

S,  of  X.  is 
i i 


The  corrected  s.s.  and  s.p.  matrix 


-1 1 - 


S.  - X.  (I  )X.  ’ 

i 1 m.  1 m.m,  i 
i l i 


i«i f . • • f u 


where  E is  an  a x b matrix  of  ones.  The  vector  of  sample  means  of 
ab 

observations  from  x1  is 


—l 

x 


m.  X.E 
i i mn 


i-i,  ...,-6 


The  "within  groups"  matrix  of  s.s.  and  s.p.  W is  defined  as 
W - S + •• • + . W has  a Wishart  distribution  on  n^  - M - t df  and 

associated  matrix  Z (denoted  as  Wp(r.JZ))  . The  "between  groups" 
matrix  of  s.s.  and  s.p.  B is  defined  as 


B - Z m. (x1-x){x  -x) ’ 
i-i  3 


whe  re 


-l  " -i 
M v m.x 

i-l  1 


B has  a noncentral  Wishart  distribution  on  n{  df  with  associated  matrix 
Z and  noncentrality  matrix  Z 1 A (denoted  as  V.MnJZjZ  A) ) , where 


and 


A-  Z m.  ( ^ -p.)  ( u -u) 

i-i 


-l  _ i 
u » M Z m.  a 

i»  l 


H0  is  equivalent  to  ttie  hypothesis  that  A - 0 

ratio  statistic  for  testing  H (Wilks  ( 1 9 5?  ) ) is 

0 

det ( B(B*W )”  ) - { II  O-'i, 
i-i 


The  likelihood 


> l > o are  the  latent  roots  of  B(B+W) 
- P - 


where  l > > • 

Note  that  with  probability  l,  1 > 4^  > i-2  >'••><*>  0 . 


-l 


-l 
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Fisher  (l 9 79},  Hsu  (1939),  and  Roy  (19  39)  found  the  joint,  null  dis- 
tribution, defined  by  A»  0 , of  >■^*•..>'6  . Constantine  (1  963  ) 

derivtiQ  the  exact  joint  density.  Let  L - diag(  £, , . . i ) and 


..  diag(k.  1 

, . . . , ) where 

1 «2 

- 

> x > c 
- P - 

are  the  latent  roots 

of  X~\\  . 

The  joint  paf  of 

*'S ' 

f(L)  , 

is 

(1.4.1) 

f(L)  - etr(-4il)  P, 
l i 

(i(n. 

, J L ) 

•;j(n 

x c ii  U 

‘ i.i  1 

,-P-i 

#(n?-p-i 

5 P P 

) ! U.-l.)  n dt. 
i< j d i-l 

i > V > ■ > - • • > > c 

- l - ? _ - p - 

where 


C 

1 


r f i. 


pf$<Vn' 


rp(^i  )rp(^?)rp(^') 


Asymptotic  approximations  tc  tve  distribution  of  L (or  functions 

of  L)  have  beer  developed  fc»*  either  large  r.  (large  error  df ) or  for 

large  iJ  (some  or  all  of  the  ^ ’s  large).  If  A U o tlien  in  most 

situations  as  n , and  hence  M , become  large,  A , and  hence  U , 

will  become  large.  In  particular,  if  the  sample  sizes  m . ....m.  are 

1 

proportional  to  M , then  A will  be  proportional  to  y . This  means 

that  a reasonable  asymptotic  approach  is  to  let  u n n ,5  where 

0 «.  diag(o  ) ) and  Let  n - ^ . Hsu  ( 1 9 u l *» ) found  the  limiting 

l p ? 

joint  pdf  as  n„  - 8 with  >1™  n.-d  of  s. , defined  by 


Zi  - ui‘,n8*{*'i(1-li,’,-di} 


eh  > o 


wb 


’.ere  >t.  - (2  ) *(  tp+2  ) ' , and 


Wl- 


-l 


if  j.  - o . 
i 
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Asymptotically  the  z^‘s  corresponding  to  different  d^'s  are  independent. 
The  z^’s  corresponding  to  equal  nonzero  d^'s  have  a joint  ’normal  type’’ 
limiting  distribution;  the  z ’s  corresponding  to  . 0 have  a joint  "chi- 
square  type"  distribution.  Fujikoshi  (1576)  extended  Hsu’s  results  using 
a perturbation  argument.  Chattopadhyay  and  Pil.lai  (1973)  and 
Chattopadhyay,  Piilai,  and  Li  (1976)  found  an  asymptotic  expansion  as 
n,  — 00  of  tne  i F,  hypergeometric  "'unction  in  (1.4.1).  Unfortunately 
their  result  involves  another  F,  function  with  the  matrix  -^Lli  as 
argument.  Constantine  and  Muirhead  ('.9 76)  found  an  asymptotic  expansion 
of  the  j F 1 function  in  (1.4.1)  -when  some  or  all  of  the  u)  are  large. 

In  a typical  multivariate  analysis  of  variance  situation  one  would 
usually  be  interested  in  testing  , at  least  as  a first  step.  is 

equivalent  to  the  null  hypothesis  that  U ..  0 . The  likelihood  ratio 
statistic  L for  testing  K0  is  defined  by 


T . 

0 


Bartlett  (1538)  showed  that  - ( n, / ^(n. -p-l  ) ) In  Tfl  is  approximately 
? 

distributed  as  x with  pn;  df.  The  factor  n.  ♦ $(n -p-l ) was  chosen 

-2 

so  that  the  resulting  statistic  has  the  same  moments  up  to  order  M as 
? 

X with  pn,  df.  An  asymptotic  expansion  for  the  distribution  function 
of  Bartlett's  statistic  has  beer  given  by  Box  (1949)  and  Anderson  (1958, 
Chapter  9). 


If  K0  is  rejected  then  the  population  means  are  not  all  equal  and 
hence  it  is  reasonable  t,o  look  for  linear  functions  w*iich  be6t  discriminate 
among  the  populations.  The  number  of  meaningful  discriminant  functions  is 
equal  to  the  dimension  of  the  subspace  spanned  by  the  population  means, 


k 
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or  equivalently,  to  the  rank  of  the  noncentrality  matrix  which  is  equal 
to  the  number  of  nonzero  ui  . Bartlett  (I9**7a)  suggested  that  the  null 
hypothesis  , that  the  population  means  lie  in  a subspace  of  dimension 
p - k , can  be  tested  with  the  statistic 

P 

\ + '(Hj  -p-l  ) }ln  n (l-l.)  . 

i«k+i 

2 

When  is  true  is  approximately  distributed  as  x with 

(p-kjln^k)  df.  is  equivalent  to  the  hypothesis  that  the  residual 

p - k latent  roots  of  the  noncentrality  matrix  are  zero  given  that  the 
largest  k latent  roots  are  nonzero. 

1.5.  Summary  of  Results. 

In  Chapter  2 "asymptotic  expansion"  and  "asymptotic  representation" 
are  defined  and  the  technique  which  will  be  used  to  derive  expansions  is 
deve loped . 

In  Chapter  5 the  partial  differential  equations,  derived  by  Constantine 

and  Muirhead  (1972)  for  the  F , p , F , ,F  , and  F hypergeometric 

21110110  oo 

functions  of  two  p x p symmetric  matrices  S and  T are  generalized 
to  include  the  case  where  the  smallest  p - k latent  roots  of  T are 
known  to  be  zero.  The  technique  presented  in  Chapter  2 is  then  used  to 
obtain  an  asymptotic  expansion  up  to  and  including  terms  of  order  n 1 , 
of  the  ?F1  function  which  occurs  in  the  pdf  of  the  squared  sample 
canonical  correlation  coefficients.  The  nonzero  population  coefficients 
are  assumed  to  be  distinct. 

In  Chapter  4 the  results  of  Chapter  3 are  used  to  obtain  a "beta- 
type"  asymptotic  approximation  for  large  n of  the  joint  pdf  of  the  sample 
canonical  correlation  coefficients.  In  contrast  to  the  more  highly  asymp- 
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totic  "normal-type"  approximations  of  Hsu  (I9^lb)  and  Sugiura  (1976),  the 
sample  roots  are  not  independent  up  to  and  including  terms  of  order  n 1 . 
The  "beta-type"  approximation  serves  as  a basis  for  the  study  of  the  esti- 
mation of  the  population  coefficients  and  for  the  study  of  the  Bartlett- 
Lawley  tests  that  the  residual  population  canonical  correlation  coefficients 
are  zero.  Since  the  distribution  of  the  sample  coefficients  depends  only 
on  the  population  coefficients,  the  part  of  the  distribution  involving 
the  population  coefficients  may  be  regarded  as  a marginal  likelihood  func- 
tion. It  is  shown  that  Fisher's  z -transformation  applied  to  the  marginal 
maximum  likelihood  estimates  of  the  population  coefficients  gives  unbiased 

estimates  to  order  n of  the  corresponding  transformed  population 

-l  -2 

coefficients.  These  estimates  have  variance  equal  to  n + 0(n  ) . 

The  Bart lett-Law ley  tests  are  investigated  using  an  approach  which  is 

similar  to  that  used  by  James  (1969)  in  connection  with  tests  of  equality 

of  the  latent  roots  of  a covariance  matrix.  The  largest  k sample 

coefficients  are  asymptotically  sufficient  for  the  k largest  population 

coefficients  and  hence  the  asymptotic  conditional  distribution  of  the 

p - k smallest  sample  cceff-I  cients,  given  the  k largest  sample 

coefficients,  does  not  depend  on  any  unknown  parameters.  The  conditional 

distribution  is  used  to  confirm  Lawley's  correction  factor  and  provide 

2 

some  information  on  the  accuracy  of  the  x approximation. 

In  Chapter  5 an  asymptotic  expansion  is  obtained,  up  to  and  including 
terms  of  no  1 , for  large  n?  and  Z~  A •»  n?S  , of  the  ^ hyper- 
geometric function  which  occurs  in  MANOVA  and  discriminant  analysis.  The 
nonzero  latent  roots  of  the  noncentrality  matrix  Z 1 A are  assumed  to  be 


simple . 
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In  Chapter  6 the  results  of  Chapter  5 are  used  to  obtain  an  asymptotic 
expansion  of  the  pdf  of  the  sample  latent  roots  of  B(B+W)_1  for  large  n? 
and  v 1 a = n?G  . The  same  approach  as  in  Chapter  4 is  used  to  investigate 
the  tests  that  the  residual  roots  of  G are  zero. 

1.6.  Notation. 

1.  R stands  for  k- dimensional  Euclidean  space. 

2.  Lower  case  letters  are  used  for  scalars  and  column  vectors.  If 

x is  an  m-dimensional  vector  then  x^  (l<i<m)  are  the  components  of  x . 

3.  0(m)  is  the  group  of  all  m x m orthogonal  matrices.  If 

H t 0(m)  then  (dH)  is  the  invariant  Haar  measure  on  0(m)  normalized 

so  that  / , (dH)  = l . 

■'O(m)  ' 

4.  V(k,  m)  is  the  group  of  all  m x k (m>k)  matrices  with  ortho- 
normal columns.  If  Hj  £ Y(k,m)  then  (dH:  ) is  the  invariant  Haar 

measure  on  V(k,m)  normalized  so  that  /yjk  (dKj  ) - l • 

5.  If  X is  an  m x m matrix  then  X > 0 means  X is  positive 

definite . 

6.  If  X (x.  .)  is  an  m x m matrix  then  (dX ) stands  for  the 

exterior  product  of  the  elements  of  X . In  particular,  if  X is  sym- 

metric then  (dX)  » A?  . dx. . where  A denotes  the  exterior  product. 

3 "i  3 

7.  i 'p ( a. ) is  the  multivariate  gamma  function  defined  by  (1.2.4). 

8.  diag(A  , . . .,A,  ) is  a block  diagonal  matrix  with  A , ...,A.  as 

diagonal  elements.  The  A^  may  be  either  scalars  or  square  matrices. 

9.  0 is  used  to  denote  a matrix  of  zeros.  The  dimension  of  o 
should  be  clear  from  the  context. 

10.  I is  the  m x m identity  matrix. 


J 


CHAPTER  2 

ASYMPTOTICS  --  DEFINITIONS  AND  TECHNIQUES 

2.1.  Introduction . 

One  of  the  main  objectives  of  this  research  is  to  derive  asymptotic 
expansions  for  the  hypergeometric  functions  which  occur  in  the  pdf's  of 
the  latent  roots  in  canonical  correlation  analysis  and  in  discriminant 
analysis  (see  Chapter  l,  Sections  3 and  4).  In  this  chapter  we  define 
"asymptotic  expansion"  as  it  applies  to  these  hypergeometric  functions 
and  develop  the  techniques  which  will  be  used  to  derive  the  expansions. 

| 

2.2.  Framework  and  Definitions. 

Assume  that 

(i)  A is  a subset  of  Rp  ; 

(ii)  N is  the  set  of  positive  integers:  and 

(iii)  F(n,  Q-)  is  a real  valued  function  defined  on  N x A (ncN,<*£A)  . 

In  the  specific  problems  considered  in  this  research,  F is  a hypergeometric 
function  of  two  matrix  arguments,  A is  the  set  of  sample  and  population 
latent  roots,  and  n is  the  sample  size  minus  some  constant. 

The  problem  is  to  approximate  F(n,  or)  for  large  n . The  following 
definitions  have  been  adapted  from  Erdelyi  (1956). 

Definition  2.2.1  — 

The  sequence  of  functions  {c^(n,  or)  , k«l,2,...}  is  an  asymptotic 

sequence  as  n -*  <®  if  for  each  a 

(n,  or)  - o(<f^(n,Qf)}  as  n - « . 

Definition  2.2.2  -- 

bet  {^(n,  or)  , k-l,2,  ...)  be  an  asymptotic  sequence.  The  (formal) 


-18- 


-19- 


series  2^  is  an  asymptotic  expansion  to  K terms  of  F(n,  or) 

as  n -•  x if  for  each  u 


F(n,  or) 


K 

y.  Wn>a) 


+ oUK(n,  o')} 


This  is  written  as 


as  n -*  . 


F(n,  or) 


k-l 


akVn,0?)  * 


Definition  2.2.3  — 

The  function  y(n,  or)  is  an  asymptotic  representation  for  F(n,  cr) 
as  n -•  00  if  for  each  or 

F(n,»)  „ <y(n,  or)  + o(<p(n,ur)}  as  n -*  ® . 

This  is  written  as  F(n,  or)  ~^{n,or)  . 

For  the  particular  functions  F(n,  or)  that  we  will  consider,  an 
asymptotic  expansion  will  be  obtained  in  two  steps.  First  an  asymptotic 
representation  will  be  derived  using  an  extension  of  Laplace's  method  for 
obtaining  the  asymptotic  behavior  of  integrals.  This  asymptotic  represen- 
tation will  then  be  used  to  derive  partial  differential  equations  satisfied 
by  further  terms  in  an  asymptotic  expansion  of  F(n,  or)  , 


2.3.  Derivation  of  an  Asymptotic  Representation. 

Assume  that  for  each  fixed  or  fc  A , F(n,  or)  has  an  integral  represen- 
tation of  the  form 

(2.3.1)  F(n,or)  / t (x){f(x,  or)}ndx  . 

D 

Hsu  ( 1 9*+  8 ) found  an  asymptotic  representation  for  such  integrals  when  D 
is  a compact  subset  of  Rm  and  t and  f satisfy  certain  regularity 


I 


conditions.  It  is  possible  to  show  that  the  functions  F(n,  or)  which  we 
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vS.ll  consider  are  asymptotically  equivalent  to  an  integral  which  satisfies 
the  conditions  of  Hsu’s  lemma  and  hence  an  asymptotic  representation 
«p{  n,  or ) of  F(n,  tf)  can  be  obtained  by  applying  Hsu's  lemma  to  this 
integral.  An  easier  approach  is  to  reformulate  Hsu's  lemma,  basically  by 
replacing  the  conditions  on  the  domain  of  integration  D with  additional 
conditions  on  the  function  f . In  fact  we  will  show  that  on  a subset  B 
of  A , F(n,  o')  ~ ^'(n,  u)  uniformly  in  a . This  means  that  given  e > o , 
there  exists  an  n(e)  , such  that  n > n(s)  implies 

[?’{n,  o)/V{r,  a1}  - ij  < e for  ail  j ■■  . B . The  result  we  need  is  given  in 

Theorem  2.3.1,  Corollary  2.3.1  is  essentially  a reformulation  of  Hsu's 
lemma. 

The  following  notation  is  used  in  Theorem  2.3.1  and  its  corollary. 

3 ( r, x ) is  the  open  sphere  with  center  x and  radius  r , i.e., 

3(r,x)  - fy: iy-x) ' (y-x ' < r ) . The  arguments  following  a partial  deriva- 
tive ol  f are  thp  values  ox’  x and  cc  at  which  t e uartial  derivatives 
are  to  be  evaluated.  F:  r example, 

of  , f df  , 

— (S.a)  means  -* — <x.ct) 

ox,  ox,  ' 


evaluated  at  x » 5 and  o'  - a . 

Trie  following  we  1 known  resi  ' i . ee.  for  r-  e, 
is  used  in  the  proof  of  Theorem  .3  ’ 

Lemma  2.3.1  — 

Let  r be  an  m x m real  symmetric  raLrix  with  1 
y,  > Y,  > • • * > v . Then 


V, 

X 


max 

x/b 


xTx 

y Tn- 


max  x ’ fx 
X 'x»; 


Be  l Irvin  { 1 0 6 0 ) ) 


a tent  roots 


and 


x ' TV 


y. 


nun 


min  x'ix  . 
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Theorem  ■?,?.  1 - - 

Let  1 1 be  a subse~  of  F.r‘  ana  A a ..-inset  of  R^  . Assume  that 
3 CA  and  that  t and  f are  eal  valued  functions  defined  on  D and 
D x A , respectively,  su  • that 

( v ) there  exis-  • ar  re i k *»  and  a "enfitant  r.  such  that 

- K 

“ (x ) f f*(x,  or ) ) is  absolutely  int  tjrable  on  D and 


/ | T'x ) i f (x,  o) } ')  dx  < r 


for  all  « ■;  b ; 

(ii)  for  each  T f ' hf  o t.  nso.  ,te  nox A mum  value  at  an 
interior  -point  • (u;  of  ' an 

0 < A inf  f{  b(a),  or]  ; 
at  B 

(iii  t ere  exists  a > > such  that  for  every  u • B , S(b  ,<.»('*)) 
is  contained  in  tne  interior  of  and  x • S{ b , h, ( Ct } } implies 

f(X,u)  > C ; 

(iv)  there  exists  a > such  that  or  every  or  c B all  partial 
derivatives 


Jf  , 

Tx~  (x'u 
i 


ix  dx 


- (x,<*) 


( i , . S f i , : } ; 

2 ° 2 n 
( v } ecu."-.,  inf  v ^(a)  _and  sup  y,  («)  « a < 11 

» t B ™ a B 


■ - • the  ■ x ■ sytnetric 

I — — IT. 

metric  ..{  (a),  a)  . ;(x,  ?)  - (a..  . (x,a) ) is  defined  for  all  a <v  B and 

x <:  Si  6.  .£(»})  ns(j  ,^(a|}  by 

i c 


w . (x,  - 2_1~_  (x,  • 

ij  to.  rx.  * 1 • 


1 
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(vi)  for  eve ry  e > > there  exists  a 6 •,  o such  that  for  ail  a t 5 

and  i,J  , x * S{»y  >(a-)}  replies  ! *.  Ax,ct)  - * (£<a),<*]|  < c ; 

(vii)  for  every  .5  > 1 there  exists  a constant  3 , 0 < tic  l , such 

tl  : • - ' 1 . and  ;B  ; 

(viii)  0 < in  t{  ;(u;)  ; and 

aiB 

(ix)  there  exists  u •»  sue?  that  x(x)  is  continuous  cn 

U 5 ( 6 j J -)}  an  for  every  e > there  exists  a 0,  > ->  such  that 
ceJB  ’ * 

x €S(d-,i(afj)  imp  lie  j | r(x)  - t{  / ... }(  < « fore-very  a £ B . 


Then  for  large  n 

/ T(X)  (f<x, nr)  }ndx  - ii'vr.':  ''‘fff?  f ■ r t - } i V f ' a ; } C^{ r, ( or) } 

D 

a func- 
tion of  x , i„e., 

. det  (-  JL%£  (&*),<*}} 

i .1 

and  ’’a  b for  large  r.'  means  lim  p «.  1 . 

n *♦  *•  " 

rroof 

The  proof  is  base-  or.  Hsu’s  ( 1 91*- ^ 3 proof. 

By  conditions  (ii;  and  (iv) 
tit 

■ f •» ( ‘r ! 1 or)  - ' (i«l|  • ••|K) 

and  the  m v * symmetric  ma’.ri*. 


Jx.  :«x  . 
i J 


1 


is  positive  definite  *>>■•  every  ■ t L . By  condition  (iii)  there  exists 
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a Oj  > c eucb  that  f<  r a contained  in  tha 

Hence  for 

each  •■  M can  iefin  the  • - x,  r)  on  8{6  , -,(«?) 

t 

by  f{x, j)  - Inf (x,  a)  . | [ilj  i (ivj  there  exist*  a 

0.  , 9 < V « o,  . rycii  . n,t  it  ■:  :■)  in  iwi  e differentiable  as  a 

i 2—1 

function  of  ::  or  . j„,  ' { u • j In  partic  Jar 

-i~~  U(«)i^)  - Lf{5(of),a}]”!  '${»), a}  - o 

i l 


4 !,(«),«)  - ( - ■ (-,{»),  ' - ff{  (c-So-] J*  ’ - «^’3jT  {~(.o-),a)  > 

i i i J 

is  positive  definite.  We  ear.  1 “(x,  - for  a w 3(  5„  §<°r|  J in  a 

Taylor  »erie?  about  ,iu)  . The  r>  •=  It  is 


(Z.J.25 


t(xs-y)  - Y{  •-  .At))  '-<!  ■'  (x^or)} 


where  !)(<*)  - i(u*  * p(o)'  - for  <:o'*  -(  >)  , o < &(»>  < i 

First  consider  the  in  ?grei 

. Pffx.aj  1n 

Il(aj  * l ifTP^ofaTj  te  vh««  * * * • 


let  G(a)  be  any  eye:  net  of  Si  . containing  C,(ar)  , The: 


from  f 2 . i . ? ) 
(2.?.3) 


!,(<*)  « / .(a))  \"X  >'.(a),a}{x-t;(ar}}  jdx 

<><») 


+ j -XliV-gi-J  dx 

r-G(u)  |f ( V(0f), ajj  " 


Is  positive  definite  and  from  condition  (iv) 

(1, im\,  ...,m)  is  a continuous  function  of  x on  Sfo. .,§(<*)}  . Hence 

, . ilB 

x-»(o)  (x-r(a)} ' of ;;(»),  ofHx-i(ar)) 
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lim  y' -'U(t*)  + &(a)y, ^)y 

y-0  y,0{r>(Q;),Qf}y 

where  y - x - s(»)  • This  result  means  that 

(2.3.4)  y'Q{  t‘(or)  + 3(«)y,  or)y  - y ' 4 3(  or),  Qf}y{  1 + X(y,  u) } 

where,  for  each  fixed  u £ B,  X(y,  or)  - 0 as  y -*  0 . We  want  to  show 

that  \(y,  ■■*)  - o as  y - o , uniformly  in  a € B . That  is,  given 

s >0  there  exists  a 6 > 0 such  that  y t S(6  . 0)  implies  | \(y,  c*)|  < e 
o o o o 

for  all  cr  € B . Suppose  ep  > 0 is  given.  By  lemma  2.3.1  and  condition 
(v) 


y'  4 ,(cr),  of)y  > v (of)y’y 

— m 


2 » 

> p.  y y 


By  condition  (vi)  there  exists  a oq  > o such  that  y *S(60, 0)  implies 
|w  {^(cr)  + £!(a)y,  or)  - .(*"(«),  cr} | < p.2  eQ/m2  for  all  i,  J (l<i,j<m)  and 


for  all  cr  6 B . Therefore  for  all  nonzero  y d S(00»  0)  and  or  t B 


I X{  y,  or) 


y' [^(s(or)->-  (or)y,  nr}  - 4 ar)  ]y| 
y’ w(s(or),or}y 


mm  22 

T-  |y±|  ly-l  »*  V® 

i-i  j-i  1 3 0 

— 2 » 

n y y 

2 2 
m eo(max|y.|  ) 

- r~> 

m y y 

, eoy'y 

- . " eo 

y y 

That  is,  X(y,  at)  -»  o as  y -*  0 uniformly  in  or  f B . 

Make  the  transformation  of  variables  y - x - *(<*)  in  (2.3.3). 


Then 
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(2.3.5) 


where 


I,  (a)  - / (gty,  <*)}ndy  + / { h (y,  or)  }ndy 

F (or)  E(ar)-F(cr) 

g(y>  o')  “ exp[-£ny\lU(or),ar)y  {l+\(y,cr))]  , 

h<y,«> 


f*(  si(0f),  or)  ’ 

F(or)  - (x-5(q<)  ; x € G(<*))  and  E(o)  = (x-5(ar)  : x € D)  . 

Since  i,){  >,(ar),  u}  is  positive  definite  there  exists  an  m x m 
orthogonal  matrix  H(a)  such  that  H(Qf)  ’ Q{ ^ ( or ) , ^}H(o)  - r^far)  , where 
T~(ar)  „ diag(  y (or),  . . y (<*)}  with  y (ar)  > y (or)  > •••  > v (ar)  > 0 . 

l Cl  i — c — — In 


Let 


and 


Then 


M(Qf)  - H ( or ) T ( Of ) 


-1 


-1 


z - M(ar)  y . 


y V.{-3(0'),0f}y  - z,M(a),Q{’(o'),a'}M(a)z  - z'z  . 

The  Jacobian  of  this  transformation  J(y  •*  z)  is 

J(y  - z)  - det  M ( cr ) - det  <,(or),  or}  ^ . &{?(«))  ^ . 

Let  e > 0 be  given.  By  lemma  2.3.1  and  condition  (v) 

(2.3.6)  u.2y'y  < y'.*{s(^)> °0y  - z'z  < n2y'y 

for  all  a 6 B . Since  \(y,  a)  -♦  o as  y - 0 uniformly  in  a f B 
there  exists  a 6 , o < 6 < 6?  , such  that  y t S(o,  o)  implies 
| \(y,  or)  | < e for  all  of  t B . Let 

D( 6)  - (z  : | | < 6 u/nA  i-l,  2, . . .,m} 

and  let  C ( 6,  or)  be  the  inverse  image  of  D ( 6 ) under  the  transformation 
z m M ( Qf)  1y  . Since  D(6)  is  open  and  the  transformation  is  continuous, 
C ( 6,  or)  is  open.  If  z € D ( o ) then  z'z  < 6 u"  and  so  by  (2.3.6),  if 
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y € C(o,  a)  then  y'y  < o2  . That  is,  C ( 6,  or)  cs(6,  o)  . If 

y € S { 5 p/(am^),o)  then  y’y  < 62^2/(K2m)  and  hence  by  (2.3.6), 

2 2 ^ 

z'z  < 6 p.  /m  , i.e.,  | z ^ | < o ^/m'  , i-i,...,m  . Therefore 

S ( 6 m/(he^),o}  c C(5,«)  . In  summary 

(2.3.7)  S ( 6 g/(xm^),0}  CC(o,dr)  CS(6,0)  . 

Let  r](z,  u)  be  the  function  defined  by  \[y,  o)  . 1)(z,  u)  where 
z - M(or)  *y  . If  z 6 D(  6)  then  y uC(6,or)  CS(6,0)  and  therefore 
| l](z,or)|  . | \(y,  a)  | < e for  all  or  € B . That  is,  >l(z,  or)  - o as  z -•  o 
uniformly  in  « t B . 

Since  o < 6^  it  follows  from  (2.3.5)  with  F(or)  . C(5,  or)  that 

(2.3.8)  I,  (or)  - / {g(y,a)]Rdy  + / (h(y,ar))n  dy  . 

C(6,ar)  E(or)-C(  6,  or) 

Let 

i?(°0  - / (g(y»^))n  dy 

C(6,  or) 

and  make  the  transformation  of  variables  z - M(o<)  *y  in  I..  (or)  . Then 

I_  (or)  - A(i(a)}"^  / Lg(M(o,)z,or}  Jndz 
D(6) 

id  d 

- A(i(or))'?  / •••  / exp[-^r.z'z(l  + (Kz,or)]]dz 

-d  -d 

where  d - 5 n/m^  . 5 was  chosen  so  that  z 6 D(6)  implies 
| H( z»  | < e Tor  all  a k.  B . Therefore 
_A  m d 

AU(o)}  * 0 / exp[-^nz  (l+i)]dz  < I, (or) 

k-i  -d  K k z 

.A  ® d 2 

< A U(o)}  * H / exp[-£nz  (l-e)]dz  . 
k-i  -d  K 

The  integrals  in  the  above  inequalities  are,  up  to  a multiplicative 
constant,  integrals  of  normal  density  functions.  Hence 
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J2(»)  - / T{‘3(or)-ty}{g(y,  Jf)}ndy  . 

C(  6,  Of) 

By  condition  (ix)  and  the  first  mean  value  theorem  we  have 

(2.3-ii)  J2  C or)  - t{  ; ( or ) + r ( of ) ) / {g(y,or)}ndy 

C ( 6,  of ) 

where  £(u)  € C(o,  u)  . Both  (2.5.10)  and  (2.3.11)  hold  for  arbitrary  5 , 

0 < 6 < min{6.  6C ) . By  condition  (ix)  we  can  choose  o small  enough 
2 5 

so  that  r,(ur)  C(o,u)  implies  | t{  3( o)  + £(or) }-t{ £(q?) ) | < e for  all 
Jf  c B . Then  by  (2.3.9) 


(2.3.12) 


Lt( a(^))-gJ 

*UM}* 


2n(i-a  ) 
n 


n*" 


n(  l +c) 


< J _,(<*)  < 


Lt{  *-»(<*)) +<0 


a(  *(«)) 


F 


2 n(  l -b  ) 
n 


n(i-s) 
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By  condition  (it  i >'  ai+v)  I *•<  ti  r>  . 

1 ■ 1 i Si  al*y,  J is  absol 

E(  i)  a ad 


Jtely  integrable  ever 


J ! Tf  Ha  |,y>  Lf{  §{U5+y,  O}  jk(dy  < , . 

FJ.u)  - ^ 


— K 


3 - Sf<>  o)  , Then  t(  Cf i»>4vl  i>r  c/  , nk 

- >+y,  »}  la  absolutely 

integrable  over  g ( <j)  _ 3 ang 


J ! T<  ^(u)+y}  Lff  ?U)+y,  u}  Jkj  dy  < r 

• — v • 


- X 


From  {2.3.71  s err-.  1 ...  , , , . 

" 1 ' b/  condition  ;vi  i ) there  exists 


a 0 , 


0 < 0 < 1 r such  that 


lb(y,af)|  v 3 

for  all  a . g and  y u K(o;  ~ p 

Then  for  n > k and  a’h  a r p, 

(2'3*,5>  ! / T(  5t ^'^yj ( a)  )ndyj 

S(or)-C(o,ar)  1 

1 / |t{§(^]+y)|  jh(y,  or)  I ndy 

E(Qf)-s 


i I |T(5('*)*y)||f(i,(c/)+y,»}|lc 

r,  7. 


di-k 


f( 


ay 


iV'-V 


vhers  the  last  line  follows  free,  condition  (U,. 
(2. 5.12 J we  hav<= 


Combining  (2.5.13)  and 


{2. 3.  Jk)  .kl'  1 -_i_' 


trs(»n  l_-r[  s t cr ) j - 


,Jr  + J (1  ~V ' ?ri'  J,  <a)  af  5(or)  }* 


1 ♦ f 


J - nTTTn fn/2n>‘ 
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for  all  / £ B , where  - r^  T K .^(n/2TTj^rr  . Note  that  , 

'md  1 do  not  depend  on  u ar.l  - 0 , - 0 , and  tQ  - 0 aB 

n _ ue  Since  ••-  > o was  arbitrary  it  follows  from  (2.3.14)  and  condi 

tions  (v)  and  (viii ) that 

i i_ 

J (Of)  ~ r i . ( or) } *U(a)}‘*r<2'Vn)^ 

\ 


uniformly  in  -<  v.  B , proving  the  theorem. 

Corollary  -- 

Le*  :■  ’ • a subset  of  Rr  and  ie>  r and  f be  real  valued  func- 

tions defined  on  'J  such  that 

(ij  there  exis*  a X > 0 such  t •••  - T(x)(f{x)}  is  absolutely 

integrable  on  D ; 

(ii  * f(  r } has  an  -d  iute  maximum  value  at  an  interior  point  ^ of 
E and  f (^)  > o ; 

( i i i ) all  partial  derivai  ves 


df 
dx . 


and 


be,  lx 


(i,  j-i , . . .,m) 


exist  and  are  ontinuous  ir»  a neighborhood  of  %,  ; 

{iv  for  every  eighborhood  N f j there  exists  a constant  9 , 
o < d < l , such  that  jf(x\'f(L\,  < i for  a'.  ^ x h D - N ; 
and 

(v)  r ir  continuous  In  a neighborhood  of  v,  and  t(  ) e 0 . 

Then  for  La  r ge  n 

» 1 

/ T(X)  { f{x)  }Hdx  - (2iv'til^n(  f{  ))nT('3)U{C)]"?  . 

D 

proof- - 

The  corollary  follows  from  Theorem  2.3.1  by  taking  B to  be  a set 


consisting  of  i point. 
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2.4.  Derivation  of  ar.  Aaym-  * ' • <[ 

In  this  section  FCa#  »J  *1  ' yptryowtr  : v.n  - 

tions  which  occur  in  the  j If  < f ’ ' *•  rre.atlor. 

analysis  and  in  iis  riminat  • nw-.  .s. 

In  principle,  further  terms  n - - • • f Fin,  u> 

could  be  obtained  by  a irv  e :**th  .<  tr*a  • • . • . 3. . ' . 

In  practice  the  mat. nematic*  la  Li  - but  It  .1  easy  to  see  that 

such  an  analysis  would  lead  t ar.  asymp  * a:  r • • :.e  f rm 

F(n,  ~v(n,  ^)G(n ,Jt)  where 

PjM  P.M 

(2.4.1)  O(n.u)  - l ♦ ~ — ♦ ••• 

n‘ 

and  +(n,or)  is  the  asymptotic  representation  of  Fin,  O obtained  from 
Theorem  2.3.1. 

By  extending  the  results  of  Constantine  and  Muirhead  (1972)  we  obtain 
a differential  equation  satisfied  by  F(n,u)  from  which  we  derive  a dif- 
ferential equation  satisfied  by  G(n, u)  . The  equation  for  G(n,ur)  turns 
out  to  have  the  form 

(2.4.2)  (nUj+l2)G(n,  u)  - h(dr)G(n,cr) 

where  and  A are  differential  operators  in  the  elements  of  u 

which  do  not  depend  on  n . Substituting  (2.4.1)  into  (2.4.2)  and  equat- 
ing coefficients  of  like  mowers  of  n-1  gives  a recursive  system  of  dif- 
ferential equations  for  the  : 

(2.4.3)  'VjPj  - h 

\Pi  + ( Vh)Pi-l  " ° i"2’  5»  ••• 

The  system  of  equations  is  solved  recursively  for  the  p . This  procedure 
is  justified  by  the  uniqueness  of  asymptotic  power  series,  i.e.  if 


0 ~ Z + ( Vh)Pi-lln 

then  Aj P^  + (A2-h)P.  : so  . 

It  turns  out  that  A.  is  a first  order  linear  differential  operator. 
It  is  well  known  from  the  theory  of  first  order  linear  partial  differential 
equations  that  the  general  solution  of  the  equation 


(2.4.4) 


V f _ R 

1-1  i 


when  the  f are  independent  of  P , is 


(2.4.5) 


P “ Q + •!'(ui,  . . .,u  ) 


where  Q is  any  particular  solution  of  (2.4.4)  and  >!r  is  an  abritrary 
function,  subject  to  certain  regularity  conditions  (see  Goursat  (1917)). 
The  u^  are  any  p - 1 independent  solutions  of  the  system  of  ordinary 
differential  equations  given  by 


(2.4.6 ) 


dor.  dor 
1 2 


CHAPTER  3 


AN  ASYMPTOTIC  EXPANSION  OF  Fx  (ki,  ^n;^q;P2,  R2 ) FOR  LARGE  n. 


3.1.  Introduct ion . 

Let  R - diag(ri , . . . , ) be  the  diagonal  matrix  of  the  sample 

canonical  correlation  coefficients  calculated  from  a sample  of  size 

N - n + l from  a (p+q) -variate  normal  population  (n>p+q,  q>p)  and 

let  p o diag( pj,  . . . , o ) be  the  diagonal  matrix  of  the  population  cor- 

2 2 

relation  coefficients.  The  joint  pdf  of  rj  , ...,r^  involves  the  hyper- 
geometric function  ,F] (^n,  in;^q;P  ,R2)  which  will  be  denoted  in  this 

2 2 

chapter  as  ^(n,?  ,R  ) . Assume  that  l > r^  > •••  > r^  > 0 , which 

is  true  with  probability  1,  and  that  P > diagfP  ,0)  where 

Pt  = diag(Pj,  ...,  o^)  (o<k<p)  with  i > Oj  > • • • > > o . 

The  technique  presented  in  Chapter  2 is  used  to  derive  an  asymptotic 

-1  2 2 

expansion,  up  to  and  including  terms  of  order  n , for  ?F1(n, P ,R  ) . 
In  addition  the  partial  differential  equations,  derived  by  Constantine  and 
Muirhead  (1972),  for  the  ?Fl(  jFj,  QF1,  jF^  and  qF0  hypergeometric 
functions  of  two  symmetric  matrices  are  generalized  to  include  the  case 
where  one  of  the  matrices  P has  the  form  P - diagfP^O)  . 


2 2 

3.2.  An  Asymptotic  Representation  of  ?F1  ( ^n,  \n;sq;p",R  ) for  Large  n . 

2 2 

An  asymptotic  representation  of  „Fj(n,  P ,R  ) follows  from  Theorem 

2.3.1  once  a suitable  integral  representation  has  been  found.  Deriving 

such  an  integral  representation  and  checking  the  conditions  of  Theorem 

2.3.1  turns  out  to  be  a rather  lengthy  process. 

We  begin  by  using  the  results  of  Chapter  1,  Section  2,  to  express 
2 2 

2Fj(n,P  ,R  ) as  a multiple  integral.  It  follows  from  (1.2.6)  that 
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Fj(n,p2,R2)  . / F !p)lK^;k;FH’R2HP)(c3H)  . 

0(p) 

Partition  H into  two  submatrices  H}  and  H2  consisting  of  the  first 
k and  l^'t  p - k columns,  respectively.  It  follows  from  (1.2.5)  and 
the  fact  that  P - diagJP^O)  that 

(3.2.1)  F (n, P2,R2  ) ~ / ,F.(k)(^n,^n;^q;P,H, ’r2H,P,  )(dH)  . 

0(P)2  1111 

Because  the  integrand  does  not  depend  on  H2  , we  can  integrate  over  H2 

using  Lemma  3.2.1  which  is  due  to  Constantine  and  Muirhead  (1976).  Even- 
2 2 

tually  Fj(n,  P ,R  ) will  be  expressed  as  an  integral  of  the  form 

/ T(x){f(x,  p,  r ) } ndx 
D 

where  p «.  (pl,...,p^)'  and  r u (r^,  . ,.,r  )’  . Theorem  2.3.1  requires 
that  the  maximum  of  f(x, p, r)  , for  fixed  p and  r , be  obtained  at  a 
unique  interior  point  of  D . It  turns  out  that  at  the  maximum  H is 
uniquely  determined,  but  II  is  only  determined  up  to  a subspace  defined 
by  the  orthogonal  complement  of  . 

Lemma  3. 2 . l -- 
Suppose 

(i)  Qj  is  the  submatrix  consisting  of  the  first  k columns  of 
Q f 0(p)  ; 

(ii)  ( ) is  the  invariant  measure  on  V(k,p)  ; 

(iii)  G - G(Q1 ) is  any  p x (p-k)  matrix  with  orthonormal  columns 
orthogonal  to  ; 

(iv)  f (Q ) is  a function  of  Q ; and 

(v)  H € O(p-k)  . 


Then 
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/ f(Q)(dQ)  - / / f(Q  ,GH)(dH)(dQ1 ) . 

0(p)  V(k,p)  O(p-k) 

In  particular  if  f(Q)  = f (Qt ) then 

/ f(Q.  ) (dQ)  = / f(Q  )(dQ  ) . 

O(p)  V(k,p) 

Applying  Lemma  3.2.1  to  (3.2.1)  we  have 

F (n,p2,R2)  = / F {k)(fe,fc;k;P,H  'R^PjKdH  ) . 

V ( k,  p ) 

For  n > k - l we  may  apply  the  Laplace  transform  relation  (1.2.3)  twice 
to  the  integrand  in  the  previous  integral  to  obtain 

F (n,P2,R2)  - {f  (4n))"2  I I /e tr(-X-Y)  det(XY)*(n"k_1 1 

V ( k, p ) X>0  Y>C 

x Fj  (kA^PjHj  ’RZH1P1X%g)(dY)(dX)(dH1  ) 

where  X and  Y are  k x k positive  definite  matrices.  By  applying 
Bessel's  integral  (1.2.7)  to  the  Fx  function  we  get 

F (n,p2,R2)  - {i.  (k)}‘2  Iff  / etr(-X-Y)  det(XY)^(n'k'n 

V(k,p)  X>0  Y>0  0(q) 

A A , 

x etr{2  lY-3X"P1H1  RrOj^  } (dM)  (dY)  (dX  ) (dH1 ) 

where  Mj  is  the  q x k matrix  formed  by  the  first  k columns  of 
M €:  0(q)  and  0 in  LY^X^Hj  ’Rio]  is  the  k x (q-k)  zero  matrix. 

The  integrand  does  not  depend  on  the  last  q-k  columns  of  M and,  for 
the  same  reason  that  we  integrated  over  H?  in  (3.2.1),  we  integrate 
over  these  q-k  columns  using  Lemma  3.2.1.  The  result  is 

(3.2.2)  F.  (n,  P2,R? ) - {r.(£n)f2  III  f etr(-X-Y)det(XY)"(n‘k_1 
Z K V(k, p ) X>0  Y>0  V(k, q) 

x etr  f 2 [y^X^PjHj 'RjOjMj ) (dMj )(dY)(dX)(dHj ) . 


w 


\ 
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It  would  be  difficult  in  the  subsequent  analysis  to  work  directly 
with  the  positive  definite  matrices  X and  Y . However,  X and  Y 
have  distinct  latent  roots  except  on  sets  of  measure  zero  with  respect  to 
the  measures  (dX)  and  (dY)  , respectively.  Therefore  we  can  write 

(3.2.3)  X » lnG'V2G  and  Y = ^nQ'u2Q, 


where  G , Q -cO(k)  , V = diag(v  , v , . . . , v ) with 

1 2 K 

Vj  > v2  > •••  > v^  > o , and  U *.  diag(Uj,u  , . . .,u^)  with 


> > 0 . The  transformation  between  X and  (V, G) 


defined  by  (3.2.3)  is  one  to  one  if  G is  restricted  to  orthogonal 


matrices  with  positive  elements  in  the  first  column.  Since  the  set  of 


G c 0(k)  with  zeros  in  the  first  column  constitutes  a set  of  zero 
(dG) -measure,  we  may  assume,  without  loss  of  generality,  that  the 
elements  in  the  first  column  of  G are  positive.  By  the  same  argument 
we  may  also  assume  that  the  elements  in  the  first  column  of  Q are 
positive  and  the  transformation  Y -*  (U,Q)  is  one  to  one.  Let 
j{X  -*  (V, G ) } and  JfY  (U, Q)}  be  the  Jacobians  of  these  transforma- 
tions. Then  from  Deemer  and  Olkin  (1951 ) 


J(X 


(V,  G ) } 


(^n)^k(k+1)22kn=k? 


n ( v . 2 -v . 2 ) 
i<j  1 J 


and 


, £k(k+l ) 2k  fe<2  k 

J(Y  - (U,Q) } - det  u H (u  -u . ) . 

rk(ik)  i<J  J 


When  we  make  this  change  of  variables  in  (3.2.2),  the  resulting  integral 


will  involve  integration  with  respect  to  (dG)  and  (dQ)  over  the  sub- 
set Ojfk)  of  0(k)  consisting  of  orthogonal  k x k matrices  with 
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positive  elements  in  the  first  column.  The  only  part  of  the  resulting 
integrand  which  depends  on  G or  Q is  X where 

X - etrfnLQ'UQG'VGPjH, 'R:0]M, } . Consider  transformations  of  the  form 

G - and  Q -•  S?Q  where  and  S?  are  diagonal  matrices  with  * l 

along  the  diagonal.  If  the  i-th  diagonal  element  of  Sj  is  -l  then  the 

transformation  G -*  SjG  changes  the  sign  of  the  elements  in  the  i-th  row 

of  G . The  same  is  true  of  S,  and  the  transformation  Q -•  S2Q  . X is 

invariant  under  these  transformations  and  therefore  the  integral  over 

-2k 

Ojtk?  with  respect  to  (dG)  and  (dQ)  can  be  replaced  by  2 times 
the  integral  over  0(k)  with  respect  to  (dG)  and  (dQ,)  . It  follows 
after  some  simplification  that  (3.2.2)  may  be  expressed  as 

(3.2.**)  2Fl(n,F?,R“)  - / h(y ) (g  (y,  P,  r ) }ndy 

A 

where 

2 

.nk  k 2k 

c _ ( an ) n 2 t 
n " {‘k(^)rk(^))2  ’ 

■ i ( y ) - ll  f(v/-v/  )(u  ' -u  2)}  det(UV)“k  , 
i<j  J J 

g(y,  o,r)  - etr( -5lJ2-1gVZ  + LQ’UQG 'VGPjHj  *R:OjM1  ) det(UV)  , 

A - V(k,p)  x 0(k ) x Dy  x 0(k ) x Dy  x V(k,q)  , 

DX  " k>  : xj  > x2  > •*•  > xk  > °)  > 

P~  (Pj*.*.*0^)  » r"  (**j»»»*>**jj) 

and  y is  a point  in  A . 

Finally,  consider  the  transformations  G - G , QG'  -*  F , MjQ'  -•  E 
and  Hj  - Hj  . Then  G, F t 0(k)  , Hj  cV(k,p)  , Ej  cV(k,q)  and 
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(dMj  ) (dQ) (dG) (dHx  ) - (dEj  ) (dF) (dG) (dHj  ) . Also,  since  tr(AB)  - tr(BA)  , 
we  have 

X - etr{n[Q,UQG,VGriH] ’R:0]m  ) 

- etrfnCUQG'VGPjHj  'RjOJMjQ'} 

- etrfnLUFVGFjHj 'RrOjE,  ) . 

Making  this  transformation  of  variables  in  (3.2.4)  gives 

(3.2.5)  2P1(nfP?,R2)  - Cn  / i|(y ) { h(y,  o,  r ) )ndy 

A 

where 

h(y,  p,r)  - etr( -kj2-^V‘  + LUFVGFjHj  ’ R 1 0 Jk  1 ) det(UV)  . 

We  now  determine  the  maximum  value  of  h for  fixed  F and  R and 
show  t^at  the  maximum  is  obtained  at  a finite  number  of  interior  points 
of  A . The  result  is  contained  in  the  following  lemma . 

Lemma  3.2.2  — 

Let 


(i) 

P! 

- diag{ jj,  . . ., ok)  with 

/\ 

A 

A 

(VJ 

Q 

A 

CL 

• o ; 

(ii) 

R 

- diagfrj , . . ,,r  ) with 

r,  > r > • • • > r > 

1 2 p 

0 ( p>k ) 

(iii) 

U 

- diag(ui, . . .,uk)  with 

U,  > U > • • • > U,  > 
l 2 k 

o ; 

(iv) 

V 

- diagJVj, . ..,v  ) with 

A 

a: 

> 

A 

A 

<\l 

> 

A 

0 ; 

(v) 

F 

- (f.  .)  and  G - (gj^j) 

(l<i,j<k)  with  F, G 

t 0(k ) ; 

(Vi) 

E! 

- (e j ) (l<i<q,  i<j<k) 

with  E tV(k,q)  ; 

(vii) 

H1 

- (hi.)  (l<i<p,  l<j<k) 

with  Kj  cV(k,p]  ; 

and 

(viii) 

h 

- etr(-^J2-^V?+LUFVGF1H1 

’RtOjEj  ) det(UV)  . 

Then  the  maximum  value  of  h for  fixed  and  R is 

-k  k -l 

e H (l-r  p ) 

i-i 
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and  the  maximum  is  obtained  if  and  only  if  u^  = v^  = (l-p^r^)  (l<i<k)  , 

G - diaglii, ) , F - diag(+i, . . ii ) , 


where  G,  F,  and  E,  satisfy  the  following  constraints 


f . .g. ,h. .e. . 
11  li  11  11 


proof — 

Let  h - eT  , where  T - tr( LUFVGP^ ,R:0]E1 ) , and 
h2  - etr(-igU2-£/2)  det(UV)  . Then  h - h^  . Maximizing  T is  equiva- 
lent to  maximizing  hx  since  hj  is  a strictly  increasing  function  of 

T . It  follows  from  Theorem  2 of  the  appendix,  that  for  fixed  U,  V,  1^, 

and  R , the  maximum  value  of  T is 

k 

v u.v.r . p. 


and  the  maximum  is  obtained  if  and  only  if  F = diag( il , . . Il ) , 


where  F,  G,  Hj  and  Ej  satisfy  the  constraints  fiigiihiieii  " 1 

(l<i<k)  . Note  that  the  location  of  the  maxima  does  not  depend  on  U 

and  V . Consequently,  h can  be  maximized  in  two  stages  as  follows: 

max  h ~ max{h  ( max  ) } . 

V,V,V,QtK1»E1  U,V  FjG.H^Ej 

The  problem  of  maximizing  h thus  reduces  to  maximizing 


THIS  PAGE  IS  BEST  QUALITY  PRACTICA** 
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g (U,Vj  - exp l T (-fuZ-k^v  r p,)J  II 

, * 1 J.  1 i 1 , . 


2 2 

-gu  , ) . 

, 1113  i i ' J 

Consider  the  function  g.,fx,  y defined  l\,r  all.  >.  > o and  y > o by 

g„(x,  y)  - -gx  - + xy«>  + In  x t in  y 

with  o < a<  i . It  is  eftr _ ly  shew:  that  g?  has  an  absolute  maximum 

-I  -1  i 

value  of  e (i  - r ained  if  and  only  if  y » {*-<*)“*  . 


Lemnvj  3 . ? . ? fellows  from  inis  result  and  the  fact  that 

l,"ripl)~*>  (l"r2p21'^>  > (J-rkpic)“*>  0 * 

At  each  of  the  points  at  whici  r.  obtains  its  maximum. 

^'ii  ” "1  ’ j " ' 1 * > h-p  - - i,  and  o . *.  1 l,  and  in  addition  f . . , 

6n>  h,M*  and  satisfy  the  constraint  f..g  h .e..  . l . For  fixed 

11  iA  ti  ii-ii  ii  i' 

i , the  constraint  is  satisfied  if  either  f.  , g.  , h . and  e . all 

ii  'ii  it  ii 

have  the  sum  sign  or  e -t : two  of  t hem  re  11  and  tue  other  two  are 

-l.  Hence,  for  each  fixed  i there  are  2 + i ’ ) . ?5  choices  of  f.^, 

. ► h . , and  ' . wlr.  :h  satisfy  t o :onstraint.  £:nce  there  are  k 

jjj 

values  of  i , J.-ier*  arc  i ' ’ points  at  which  h obtains  its  maximum. 

Partition  0(k)  into  2 sets  such  that  the  signs  of  the  diagonal 

elements  of  the  rutr  . es  in  any  set  ar*  constant.  This  means  that 
T - C 1 ) c 0{k)  and  S - (s.  ) >.  0(k)  are  in  the  same  set  if  and  only 
if  sign  - sign  t . O^irk)  . This  procedure  ignores  matrices  with 

diagonal  elements  equal  to  cero,  but  they  constitute  a set  of  measure  0 
with  respect  to  the  invariant  measure  on  0 ( K ) . Now  partition  V(k, p) 
and  V(k,q)  into  2'  secs  each,,  :.n  0 similar  manner.  Partitioning 
OfV)  (corresponding  to  F and  G),  V(X, p)  (corresponding  to  Hj  ) , 
and  V(k,q)  (corresr  nding  to  E^)  and  forming  the  Cartesian  product 
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I'iw’.v.  o-oi  i roiwc 
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A - V(k, p)  X O(k)  X Dv  X 0(k)  ' D(J  x V(k,  q) 

induces  s partition  of  A into  2**^  disjoint  sets.  Exactly  25*  of 

these  set*;  ci  rita  in  *.  • r ; ,c»r ; ->r  one  of  tig*  p int  ' at  which  h obtains 

k 3k 

its  maxim* :ra.  TV>*  ot  ' ?ts  car.  be  joined  to  these  2 ‘ sets  to 

3k  3k 

obtain  a partition  f A into  2 jisjcint  sets  A.  (i<i<2  ) . 7t 

ft 

follows  from  the  natructi on  ••rrocedure  that  . or  each  1 , i|  h is 
integrable  over  \ , h has  in  absolute  maximum  of 

-k  * . -! 

e Ml-Vt) 

lxl 

on  , the  riaximurr.  is  obtained  at  an  interior  point,  and  is  of  the 

form 

(5.2.6)  Ai  “Mil  XMi2XDV>lMi5XDUXMi4 

where  M.  , cV(k,p),  NT,  ^O(k),  M,  ? C0(k),  an  i cV(k,q)  . 

3k 

Now  write  (5.2.5)  as  the  sum  of  z integrals,  whe^e  each  integral 

is  the  integral  of  C .hr  >ver  one  of  the  A.  . The  only  port  of  the 

n 3. 

integrands  which  lepend  on  F,  G,  Ii , , or  is  X , where 

X - etr : n [JJFVGP.H  'R:OjI  1 . X is  nv.r  .ant  ur.dei  transformations  of 
11  1 

the  loro*. 


F " FSj , G -S^,  - diagtSj.I^mjSg, 


and 


*> 


f»here  S , and  X are  k ? k diagonal  matrices  with  il 

; ' 2' 


ii  agonal  . 

sc  transformed  by  an  appronrinte  -hoicr  of  Sj , S„.  and  3^  , 
3 , S^,  and  S deper:  on  i , to  an  integral  of  the  form 

2i  - cn  / H(y)(b(y,  p,  r)V‘dy  , 

“i 


on  the 
5k 


if, 

The  integral  over  A.  (i<i<2‘  ) can 


where 


(5.2-7) 
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-k  -1 

e 11  (l-p  r ) , 

i-l 


is  obtained  at  the  interior  point  fj(p,  r)  defined  by 


u.  - v.  = (l-p.r.)  d , i»i,  . . ,,k  . 
l l i l ' ’ 

It  follows  from  (3.2.6)  that  is  of  the  form 


(3.2.9) 


U.  = N.  x N x D x w.  xd  x N. 
i ii  i2  i3  U i4 


where  Nn  <^V(k,p),  N . , co(k),  Ni3  co(k)  and  N cv(k,q)  . For 
m > k let  V+(k,m)  be  the  set  of  all  m x k matrices  in  V(k,m)  with 
positive  diagonal  elements,  if  k a m then  write  0+(k)  for  V+(k, m)  . 
It  follows  from  the  procedure  used  to  construct  the  partition  of  A , 
that  V+(k,p)  cNn,  0+{k ) c u 0+(k)  c N._  , and  V+(k,p)  € N.1(  . 
Combining  these  results  we  have 


(3.2.10) 


jF^^pSr2)  - 7,  J. 

1ml 


We  will  now  derive  an  asymptotic  representation  of  Jh  . In 

fact  qx  does  not  depend  on  i , i.e.  yc  » V for  all  i • It  then 

3k 

follows  from  (3.2,10)  that  2 y is  an  asymptotic  representation  for 
2Fj{n,p2,R2)  . 

Consider  a particular  Jh  . Theorem  2.3.1  is  not  directly  applicable 
to  the  integral  representation  (3.2.7)  because  the  function  h depends 


on  F,G  c 0(k)  , t V(k,p)  , 


Ej  t V(k,q)  and  the  elements  of 


where  is  the  imagine  of  under  the  transformation.  Sj,  S 2,  and 

S,  are  chosen  so  that  the  absolute  maximum  of  h on  w.  , 

3 1 


:3.2.8)  F - G - Ik  , Hj 


these  matrices  are  not  independent  since  their  columns  are  orthonormal. 


:s  53- 1 Ql'ALin  «. 

^ r^cr^ 


^4 
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It  might  be  j.;  ;:s  -‘>le  r.o  general  ;-e  the  the-  re®  to  handle  this  situation, 
but.  there  vou.'  I probe.'-  stii:  be  difficulties  in  :r«iputing  a quantity 
which  would  correspond  to  the  Hessian  in  the  present  statement  of  the 
theorei  ssl  from  f,  0,  H,,  and 

E,  to  a set  of  independ*  H G >{k1  then  g'g  - L and 

1 K 

hence  det  G ••  ii . 

Deficit  lor  >■?. 1 — 

G t 0(k;  is  sa  .i  3 be  proper  if  net  G - ’ and  improper  if 
det  G *»  -i  . 


Murnaghan  (1$>8)  proved  that  :.y  vrc  er  k -<  k or  hogonal  matrix  G can 
be  expressed  as  G - e.xp{S ) w-  .e re 

_ 2 _ 3 


exp ( s • 


IT  TT 


and  S » Is. .)  is  a k x k skew-symmetric  matrix.  The  £k (k-l J 
elements  of  £ above,  t.-.e  diagonal  provide  a parame t, r i zat ion  of  Q ■ 

Tie  mapping  G ~ S define:  by  G - exp(S  ’ is  a mapping  from  0(k) 
into  R‘  ' . The  magine  of  0(k;  under  this  mapping  is  a bounded 

subset  of  R^"r"‘ ' (c.f.  Murnaghan  (1938)). 

Since  det  G is  a continuous  function  of  the  elements  of  G and 


det  I ~ l , there  exist'  a neighborhood  ?!  co(k;  of  I such  that 
*» 

G o N imp- iien  1 is  vr<  v c.  . t -0  J ov  . fr  ra  3.2.9J  and  the  subsequent 

discussion,  that  we  car.  choose  N t r +(k  , If.  fc  V+(k, p)  , and 

1 

3k. 


K tV+(kfq]  such  that  for  all  . 


c « c Ki;  . » c R1;  . 


L o 


C 1),  ' K, 

X 


and 


j Jy. 

L’  n 


N2  CN14 


In  particular,  if  p •>  k , take  K,  - N and  if  q - p - k , take 

Let 


N - N,  - N2 
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(3.2.11  ) s.Kj  xsxy  Nxy  . 

Then  K c , all  orthogonal  matrices  in  H are  proper,  and  p(p,  r) 
(defined  by  (3.2.8))  is  an  interior  point  of  H for  all  (p,r)  . 

J\  may  be  written  as 

(3.2.12)  J.  - ^(Lj+L^) 
where 

(3.2.13)  Lx  - / Tl(y)(h(y,  p,  r )}n  dy 
and 

(3. 2. 14)  L.  - / 1(y)(h(y,  p,  r))n  dy  . 

1 U. -H 
1 

Lj  and  L^2  will  be  treated  separately.  We  will  use  Theorem  2.3.1  to 

derive  an  asymptotic  representation  of  L.  and  then  we  will  prove  that 

i 

Li?  is  asymptotic-5  lly  of  lower  order  of  magnitude  than  Lj  . 

Using  the  notation  of  Theorem  2.3.1,  let 

(3.2.15)  A - { ( p,  r ) : l;  px>p2>*  • •>pk>o,  i>rj>r2>  • • ->rp>o] 

and  for  each  s , 0 < e < £ , let  3 - B(e)  be  the  set  of  all 

( p,  r)  v A rveh  the.':  the  p.  ' c differ  from  one  another  and  0 and  1 by  at 

* 

least  £ and  the  . ^ : r aiffjr  from  one  another  and  o and  l by  at  least  e . 
That  is.  if  pQ  « rp  l,  p. ^ , - o,  and  r^+1  - o , t*>en 

(3.2.16)  B - {(p,  r)  : p,  - p±j_x  > e and  r^  - rJ+J  > c for  all 

i ( 0<i<k ) and  J ( 0<  j<p ) } . 

Derivation  of  r.n  Asymptotic  r>~ presentation  for  — 

F and  G arc  proper  in  H and  hence  can  be  parametrized  by 


(3.2.17) 

G - exp(S)  , 

s - (3lj 

(3.2.18) 

F - exp(T)  , 

T - (tiJ 
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where  S and  T are  k x k skew-symmetric  matrices.  Anderson  (1965) 
computed  the  Jacobian  of  these  transformations  as 

J'°  " S1  ' TiJ  S ‘ * 7T  trs2  * KIT  trs“ 

+ " jx?T~  (tr  s2)2  + 0(sij5)j 

with  a similar  expression  for  J(F  - T)  . In  this  section  the  notation 

0(s,  means  terms  in  the  s. . which  are  at  least  of  order  m . For 
ij  ij 

our  purposes  it  is  sufficient  to  note  that 

rk(ik) 

J(G  - S)  - (1  + 0(s  )) 

2 n* 


rk(fc)  2 

J(F  -T)  - =■  (1  + 0(t  )}  . 

k ^k2 

2 rtz 


Let  [Hj J be  a p x p orthogonal  matrix  whose  first  k columns  are  Hj 
James  (1969)  showed  that  a parametrization  of  H is  given  by 


(3.2.19) 


,T”h  w.2l' 


Ch.  :-l  - exp(V)  - exp  . 

l ' 1 2 0 


, W - (Vy) 


Similarly  a parametrization  of  E is  given  by 


(3.2.20) 


Tzn  -z’H\ 

[Ej  :-J  - exp(Z)  - exp;  z n j , Z - (Zy) 


Here  Wn  and  Zn  are  k x k skew-symmetric  matrices,  Wj  is 
k x (p-k)  , and  Z2J  is  (q-k)  x k . The  Jacobians  of  these  transforma- 
tions are  (cf.  James  (1969)) 


JIH,  - cwn,w12l)  - TT55  I>  * 0(«y  )) 
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rk<«> 


JlE,  -(Zn,Z21)t  -jllll  *0(*  I) 


The  point  p(p, r)  defined  by  (3.2.8)  is  mapped  into  the  point  §(p,  r) 
defined  by 


(3.2.21 


S-T-W.j  -Zn  - 0,  W12  - 0,  - 9, 


u.  -v.  - (1-p.r.)  s 
l i ii  (l<i<k) 


and  H is  mapped  into  a set  D containing  _,  ( p,  r ) in  its  interior.  It 


follows  from  (3.2.17)— (3.2.20)  that 


8ii  ■ 1 - 4 ,Vu2  * 0l3ij’! 

J-l 


( Ki<k ) 


eij  - SU  * 01slj 


fn  - 1 - * 


( l<i<k ) 


f , , - t + 0(t  ) 

ij  ij  ij 


(i<i/j<k)  , ti-  - -t 


hi  " 1 ' i * 0<wu  1 


(l<i<k) 


h«  - V * 0<"u  > 


) , wij  “ -wji 


eu  - 1 ' i Ezi/  + 0(zij5) 

."!■  i 


(l<i<k)  , and 


•lj  - *lj  * 0,2U  > (l<l<q,l<j<k)  , ly  - -z}i 

where  G - (g^  . ),  F - (flj)f  - (hij},  and  E,  - (e^) 
from  these  relations  after  some  simplification  that 


It  follows 


tr( [UFVGP  H *R :0]E  } - Su.v, pr.  + t 
11  1 ..llxl 

1»  1 
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where 


(5.2.22)  * . - ^ 

* (V3siri*VlSJrj,tUSlj  * ^UiVJ  pjri+UJVi°irj  ^1J  vij 

+ (u.v . p.r ,+u.v. p.r. )t . .z. . + (u.v.  p.r.+u.v.  p.r,  )s.  . w.  . 
i J J J ] n i ij  ij  i i j i J j i i ij  ij 

+ (u.v. p.r.+u.v. p.r. )s. .z. . + (u.v. p.r ,+u.v. p.r. )w.  z.,} 

i i J j j J i i ij  ij  i i i j J j j i ij  ij 

k p 2 2 

- 2 S {u.v. p.r .w. .z.  . + iu. v.  p.r.  (w.  . +z.  . )} 

i.i  j-k+l  1 11  J 1 11  1 iJ 

k q 2 

- -i  T £ u.  v.  o.  r . z . . 

. , . i ii  i ij 

l-l  j-p+l 

Combining  the  previous  results  we  have 


(5.2.23) 


L - / T(x){f(x,  p,  r)}ndx 


where 


r (fcir  iii)  k „ k 2 2 

,(x»  ‘ ^kkl^ip.,,)  J/Yi1  llui  '"j  >,vi  -vj  » 


x J(G  — S )J(F  -T)J{Hj  - (Wu»W12))JlEi  - {Zu'Z21)}  ’ 


k 2 ? k 

f(x,  o,r)  - exp(  y (-k  -k  +u.v  p r )+f)  .1  (u.v  ) , 

i-i  1 11111  i.i  1 1 


D is  the  image  of  3 , and  x t D . 

An  asymptotic  representation  of  Lj  will  now  be  obtained  by  apply- 
ing Theorem  2.3.1  to  (3.2.23).  We  must  verify  that  the  conditions  of  the 
theorem  are  satisfied.  In  the  following  discussion  (i)-(ix)  refer  to  the 
corresponding  conditions  (i)-(ix)  of  the  theorem.  Recall  that  A and 
B - B(e)  were  defined  by  (3.2.15)  end  (3.2.16),  respectively.  S(6,x)  , 


defined  for  all  0 > 0 , is  the  open  sphere  with  center  x and  radius  6 . 


I 
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Verification  of  the  conditions  of  Theorem  2.3.1  — 

(i)  t(x)  and  f(x, o,r)  are  non-negative  functions  on  D . If 
n > k - l , then  from  (3.2. 5) 

2F1(n,PZ,R2)  - C n / i|(y)(h(y,  p,r)}ndy 
A 

where  and  i|  are  defined  in  (3.2.4)  and  h is  defined  in  (3.2.5). 

Cn  > 0 and  f)  and  h are  non-negative  on  A and  A x A , respectively. 
Hence,  for  n > k - l 

C"1  ?F1  (n,p2,lT  ) > / 'l(y){h(y,  p,r))ndy 

- / t(x  ) { f (x,  p,  r ) } ndx 
D 

where  2 is  defined  in  (3.2.11).  Therefore  t(x ) ( f (x,  p,  r ) }n  is 

absolutely  integrable  on  D for  all  ( p,  r)  cj)  and  n > k - l . James 

(19^8)  shows  that  zonal  polynomials  on  the  domain  of  positive  definite 

symmetric  matrices  are  non-negative,  increasing  functions  of  each  of  the 

latent  roots.  The  coefficients  of  the  zonal  polynomials  in  the  series 

2 2 

representations  of  2FJ(n,  P ,R  ) are  positive  because  n > k - l and 

2 2 

q > k (see  1.2.1  and  1.2.2).  Therefore  ^ (n,  P , R ) is  a non-negative 

2 2 2 2 

increasing  function  of  the  's  and  r^  's  since  P and  R are  posi- 
tive definite.  In  particular 

2F1(n,P?,R2)  < gFjn,  (l-e)Ik,  (l-cjI^J 

for  all  (p,r)  €B  . Take  - c"1  ?F,  (n,  ( l - c ) 1^,  { l - e )IK ) • Then 

r > / I T{x)(f(x,  p,  r) )n| dx 
D 

for  all  ( p,  r ) t B . 

(ii)  We  have  shown  that  for  each  { p,  r)  6 A , f(x, p, r)  has  an 
absolute  maximum  value  on  D which  is  obtained  at  t)ie  interior  point 
s(p, r)  defined  by  (3.2.21).  From  Lemma  3.2.2 


* 


1 


-4  8- 

-k  k -l 

inf  fU(p,  r),  p,  r}  - inf{e  n (1-p  r ) ) 

B B i-i 

-k 

> e > 0 . 

(iii)  It  follows  from  (3.2.11)  that  D is  of  the  form 
(3.2.24)  D » Kj  x K2  x Dy  x K?  x Dy  x Ku 

where  Kj  is  a neighborhood  of  (V?n»0,  wio-0)  , is  a neighborhood 

of  (S-0)  , Kj  is  a neighborhood  of  (T-0)  , and  Ku  is  a neighborhood 
of  (Zj^O,  Z -0)  . Let  x^p,  r)  be  the  point 

_i  _i 

{(i-o^j)  3,  . . .,  (l -o^r^ ) '}  . It  follows  from  the  definition  of  B that 
there  exists  a o > o such  that  S{ o, x ( p,  r ) ) is  contained  in  the 
interior  of  Dy  and  D7J  for  every  (p,r)  tB  . This  fact  and  (3.2.24) 
imply  that  there  exists  a , o < 6i  <_  6 , such  that  S{  , § ( p,  r ) ) 
is  contained  in  the  interior  of  D for  every  ( p,  r)  c B , f(x,  p, r)  > o 
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for  all  x € D and  ( p,  r ) € A by 

, . -d2ln  f , 

Vx'p,r)  “ *7"  (X’P’r)  • 

x J 

To  compute  Q{  i>(  p,  r ),  p,  r}  we  need  the  second  partial  derivatives  of 
iji(x,  p,  r)  = -In  f(x,  p,  r)  evaluated  at  (?{p,  r),  o,r)  . It 

follows  from  (3.2.22)  that  for  fixed  (o, r)  , the  only  nonzero  second 
partial  derivatives  of  >ji  evaluated  at  {^(p,r),p,r)  are 


2 2 

d ■)  3 j 

— t — = — t » o.r.  - 2 

^2  ,2  11 

du.  ov. 

l l 


du. dv.  " Diri 


(i<i<k)  , 


( l<i<k)  , 


-.2  , ^2  .2  , p.r.  p .r . 

d d d y d )■  l i + J J 

A 2 “ , 2 " z“  2 = 1 - p.  r . l-p.r. 

OS . . dt  . . dw . . dz  . . 11  J J 

xj  xj  xj  ij  d 


{ l<i<j<k)  , 


a% 

a2  'l* 

dt . . ds . . 
1J  xj 

dt . . dz  . . 
x.i  xj 

* { Ci- 

a2) 

P/i 

+ o.r . 
i J 

dt . . dw.  . 
xj  iJ 

{(1-°iri 

) (l-o.r 
J < 

a2  v' 

PJrj  + 

o.r. 
i i 

dsij32io  “ 

l - p.  r . 
i l 

l -o.r . 
J J 

d2')r 

P-  r4 

X j . 

P/i 

' 

'-Vi 

l_PJrJ 

A 

Vi  . 

pirj 

1-Piri 

1-PJrJ 

2 2 
d ♦ djL 

piri 

1-PjT 

i 

p.  r . + p.r. 
ix  j J 


X x'  'j  j 


(i<i<j<k)  , 


(i<i< j<k)  , 


(i<i<j<k)  , 


( i< i< jck ) , 


( l<i< j<k)  , 


( l<i<k,  k+l< j<p ) , 
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a2 

•r~i 

O. 

• 

dw.  . 

dz.  . " l-o.r. 
ij  ii 

l 

o 

a # 

piri 

az.2 

ij 

1 -o.  r . 

i l 

it 

follows  from 

these 

results 

and  the 

pi’ 

r . < l - e 
J 

( i<i<k, 

i<j<P) 

that 

(l<i<k,k+l<j<p)  , 


(l<i<k,p+l<j<q)  . 


suP|u).  .(i(p,r),  j,  r}|  <-  maxL2-i:?\2{i-£)2/(e(2-s))]  - r<  ® 

B J 


By  Lemma  2.3.1 
2 , 


y ( o,  r } = max  x S(S,(  p,  r),  p,  r}x 

i » 

X XdI 


k k 

max  2 2 x x.w  U(  p,  r ),  o, r } 

X Xal  i„l  j„l  J J 


k k 

< max  2 2 |x.|  | x | | u.-.  { ^(  p,  r),  p,  r}  | 


x x=i  i=i  j=i 


< C max  { 2 | x . 


x x=i  i. 


< k2r 


2 2 

l£t  h - k f . Then 


2 , , 2 

sup  { p,  r ) < * < “ . 
B 


We  will  prove  that 


inf  Yk  (p,  r)  > o 


by  computing  il(s(p,r),  p,r)  , the  Hessian  of  ■!,•  for  fixed  (p,  r)  (•  B 
evaluated  at  ( ,(p,r),p,r)  . Since,  by  the  same  argument  that  was  used 
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L 


in  the  proof  of  Theorem  2.3.1,  (ii)-(iv)  imply  that  c,(  o,  r 1,  p,  r}  is 
positive  definite  for  all  ( p,  r)  6A  , we  have 
(3.2.25)  A{.;(p,r)}  = det  fl!  ■>(  P,  r ),  o,  r) 

k 


n V,  (p,r) 
i-i 


2,  , 2 (k-l ) 

< Yk  ( P,  r ) h 


We  will  prove  that 


0 < x - inf  A{?(  p,  r) } 

B 


and  th's  result  together  with  (3.2.25)  implies 


inf  y ‘(p,  r)  > ;f'  (k_1  ) inf  4${p,  r))  = {k_1  1 x > 0 • 

B k B 


The  determinant  of  a matrix  is  unchanged  if  a pair  of  rows  and  the 
corresponding  pair  of  columns  are  interchanged . By  a sequence  of  such 
operations  A{^(p,  r)}  can  be  reduced  to  the  determinant  of  a block 
diagonal  matrix  of  the  form 

diagfWj,  ...,Wk,X12,X1?,  ■••>Xk_1}k’Y1>k+1>  •••’Yk,k+r  ’••’Ykp,Zl'  • 

where 


•’V 


d?t 


a2* 


3U 


du.  dv. 
i 1 


a% 


dv.  du.  , 2 

li  dv. 


( l<i<k)  , 
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X.  . 
U 


a2  $ 

2 

a iji 

a2->' 

2 “ 

d * 

as  2 
ij 

as . . at . . 
xj  10 

as.  . aw  . 
xj  iJ 

as . . az . . 
ij  xj 

a2 

a2’'* 

a2  * 

a2  i 

. . 3S  . . 

1J  1J 

at  2 
xj 

at . . aw . . 

XJ  XJ 

at . . az . . 

XJ  XJ 

a2t 

a2  * 

2 

a 

a2  ■!. 

Aw . . As . . 
' xj  xj 

aw. . at . . 
xj  xj 

2 

aw. . 
XJ 

3w.  . >z . . 
xj  XJ 

a2.:- 

a2  Tjf 

/> 

a2  a 

3z.  .As.  . 
XJ  XJ 

iz , . at . . 
ij  xj 

az . . iw . . 
XJ  XJ 

az. 2 
xj 

YiJ 


(l<i< j<k)  , 


( l<ri<k,  k+l<  j<p  ) , and 


dz.  . 3w.  . , 2 

ij  IJ  °Zi,i 


; 


Z, 


Piri 


I 1-p.r.  q-p 
IX 


All  derivatives  are  evaluated  at  the  point  { H P,  r ),  p,  r)  . We  ttien  have 

k k k p k 

d(;(p,r)}  - il  det  W.  0 det  X.  . 11  II  det  Y.  . n det  Z.  . 

i-1  i<  j 1J  i-l  j-k+l  i-i  1 

After  a considerable  amount  of  simplification  A{^(p,  r ) ) reduces  to 

,2k  k 


(3.2.26)  A{ ';  ( p,  r ) } - 22k  n {(l-0ir.)p'q+1(piri)q‘p1 


In  1 


x n 


1 / .**,  2 2,,  2 2,  , v 2,  2 2, 

k (p  r -o  r ) (p  -p  )(r  -r  ) | k p p (r  -r  ) 

j-  1 tj  j 1 j i j s n n 1 1 j 


i<  j 1 (i-Piri  )4(i-p.r  . )4 


^ 2 
; i-i  j-k+i  (i-pjLri) 


It  follows  from  (3.2.26)  and  the  fact  that  ( p, r)  €B  implies 

- e (l<i<k,  l<j<p)  , that  inf  A{^(P>r)}  - x.  > 0 . 

B 


e < Pi,  r^  < i 
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I 


(vi)  The  set  D -D^e)  defined  by  D^e)  = { 5(  p,  r ) : ( p,  r ) t B) 

is  a bounded  set  contained  in  the  interior  of  D . It  follows  from  the 

definition  of  ^,(p,r)  (3-2.?l  ) and  the  definition  of  D (3.2.24),  that 

there  exists  a 6 > o such  that  S{26,x)  CD  for  every  x 6 Dj  • Let 

D0  ( e)  = (x;x  € S(  6,  g(  Pi  r ) ) for  some  ( p,  r ) B(  e ) } . Then  Dg  ( e)  is  a 

bounded  set  contained  in  the  interior  of  D . Let  cl(D„*B)  be  the 

closure  of  D x B . Since  D and  3 are  bounded  subsets  of  Euclidean 
2 2 

space,  cl(D0XB)  is  a closed,  bounded,  and  hence  compact,  set. 

■Jt(x,  p,  r)  - -In  f(x,  p,r)  is  a continuous  function  on  D x A and  hence, 
Ji(x,  p,r)  is  uniformly  continuous  on  cl(D.,xB)  cp  x A . In  particular 
for  any  e > o there  exists  a 6^  > 0 such  that  for  all  (p,  r)  t B , 
S{6  , i(p,  r)}  c cl(D  xb)  and  x € S{  6 , s(  p, r ) } implies 
[u>  (x,  p,r)  - ^ a§(p,  r),  p,  r)|  < ej  . 

(vii)  Recall  that  from  (3.2.23)  and  (3.2.13) 

Lj  » / T(x){f(x,  p, r)  }ndx  = / Tl(y){h(y,  p,  r)}ndy 
D H 

where  y - x,  0 ^ 'd  - D,  h(y,  p,  r)  - f(x,  p,  r),  i](y)  - t(x),  and 

B(p,  r)  - ^(p,  r)  under  the  mapping 

(FjGjHjiEj  ) - (T,S,Wn,W12,Zn,Z21  ) . 

Let  q denote  this  mapping  in  terms  of  the  x and  y parameters,  i.e., 
x - q(y)  . 

Condition  (vii)  of  Theorem  2.3.1  says 

a)  for  every  6^  > 0 there  exists  a constant  B , o < B < l , 
such  that  | f (x,  p,  r )/f{  -;(  p,  r),  p,  r)  | < B for  all 
x € D - S{6M,5(p,r))  and  ( p, r ) t B . 


We  will  prove  the  following  more  general  result 
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b)  for  every  6^  > 0 there  exists  a constant  u , o < 0 < l , such 


that  |h(y,  p,  r )/h(  g(  p,  r ),  p,r}|  < o for  all  y f - S{  64,  p(  p,  r ) } , 


3k , 


( p,  r ) c B , and  i ( l < i<  2 ) . 


The  result  b)  will  be  used  to  prove  that  L. , defined  by  (3.2.14),  is 


asymptotically  of  lower  order  of  magnitude  than  Lj 


To  prove  that  b)  implies  a),  consider  a fixed  c>4  > 0 . Since  the 


set  D - S{  o4,  s(  P> r ) ) is  closed  relative  to  D and  q is  a continuous 


mapping  from  onto  D . it  follows  that 


D - S{o4,'3(p,  r)}  under  q , is  closed  relative  to 


the  inverse  image  of 
. Therefore 

H - n is  open  relative  to  S and,  since  q{d(p,r)}  = h,  ( p,  r ) , (p,r)  H - ,i 

Hence,  there  exists  a 6 > o such  that  S f 6,  . (p,r))  Decs-  11  . Frcm 
b)  there  exists  a constant  0 , 0 < 0 < l , such  that 


| h(y,  r )/h{  ;(  p,  r ),  p,  r } | < 0 for  all  y f ■ S(6,  ( p,  r ) } and  ( p,  r ) c B . 


-l 


By  definition  f(x,  p,  r)  = h(q"  (x),p,  r)  and  n c H - S{  6,  t ( p,  r ) } . There- 


fore | f(x,  p,r)/f{  s(  p,  r ),  p,  r}  | < 0 for  all  x CD  - S{c4,£(p,  r)}  and 


(p,  r)  t B . 


was  defined  in  (3.2.9)  as 


" Nil  X Ni2  X CV  X Ni3  X DU  X NiU 


where  cy(k,p),  N.?  co(k),  Ni?  c o(k),  and  Ni4  cv(k,q)  are 

neighborhoods  of 


‘k 

0 


Tk  ’ Tk  * and 


"k 

0 


respectively.  We  have  used  y to  denote  an  arbitrary  print  in  Cl  , but 


in  fact  the  components  of  y are  values  of  u , v , G , F , , and 

Ej  . To  emphasize  this  fact  we  will  write 

y » (H,,G,  v,F,u,Ej  ) . 
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For  each  point  y = (H1,G,v,F,u,E1  J € 4 , let 


Lk 

0 


Lk  ’ 


Ij,  > u > 


It 

0 


Cl  is  a subset  of  Euclidean  space  so  for  x,  y G let  d(x,  y)  denote 
the  Euclidean  distance  between  x and  y , i.e.,  d(x,y)  ~ [ (x-y ) ' (x-y) 

The  following  lemma  is  needed  in  the  proof  of  b ) . 

Lemma  3.2.3-- 

For  every  6 > o there  exists  a constant  j - 0(6)  , o < d < l , 

Jk 

such  that  if  y £ A.  for  some  i (i<i<2  ) and  d(y  , .-.(  o,  r } ) > 6 for 

X 

seme  ( p,  r)  t B , then 

|h(y,  p,r)/h(b(p,r),  p,r}j  < ). 

Proof- - 

It  follows  from  the  definition  of  y and  p(p,r)  that 


d(yA,  A P,  r ) ) 


k 2 4 2 

S [{ui-(i-piri)  -}  + {vi-(l-piri) 

i=l 


If  d(y  , 3(p,r))  > 6 , then  there  exists  a J (l^Jkk;  such  that 


either 
(3.2.27 ) 


,-h 


Iuj  - ( 1 - Pj-r j ) “ - 1 > 6/{2k) 


or 


(3.2.28) 


-i,  4 

VJ  ” ^1_PjrjJ  ’ I > 6/ ( 2k ) •' 


From  Lemma  3.2.2  and  Theorem  2 of  the  appendix 


h(y,  p,r ) 
h(  4 p,  r ),  p,  r] 


< sup 

F,G,Hj  ,Ej 


My,  p,  r ) 
h{  (■(  p,  r),  p,  r) 


exp 


v +piriuivi  + ln(u.vi)+l+ln(l-piri)} 

i-i 


< exp{gj (Uj,  Vj,  Pj,  Tj) 
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where  gi{w  w ■ •^j2-h?+pjrjujVlii{yj)+1+h(1'pjrj1  • By 

2 2 

Cauchy’s  inequality  v u < £(v  +u  ) and  therefore 

J J "*  J J 

g! t w pj'rj}  i g2(uj’V  pj'rj}  ' where 

g2luj’vj’^rj'  - - ^j2(1-pjrj>  - ^j2(1-Pjrj5  + ln(UJVJ5  + 1 * 

To  prove  the  lemma  it  is  sufficient  to  prove  that 


sup 

' j’  v ~ j 


S2  "V  WV  ■ v<  0 


The  lemma  will  then  follow  by  taking  0 = e . it  is  also  sufficient  to 
prove  that 

sup  g^VV  W < ° • 

wpj’rj 

However  g5  is  much  easier  to  work  with  than  gj  . 

From  (3.2.27)  and  (3.2.28)  we  have 


u. 


(1-PjTj)  2 + d cos  w 

Vj  - d sin  w • 

i _i 

where  d>  6/(2k)z,  o<w<  2n,  d cos  w > -(l-p  r_)  2 , and 
•»  — J U 

d sin  w > -(l-p  r )"^  . Make  this  change  of  variables  in  g 
J 2 


simplifying  we  have 


- g3(d>w'pj'rj) 


After 


where 


2 i 

g3(d,w,  pj,rj)  - -^d  (1-PjTj)  - d(i-Pjrj)z(cos  w + sin  w) 

♦ ln{l+d(l-p  r )^cos  w}  + ln{l+d(l-p  r )^sin  w}  . 

J J 0 J 

A straightforward  exercise  in  calculus  leads  to  the  result  that  for  fixed 
d,  Pj,  and 
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sup  g3(d,w,  Oj.rj)  = gj (d,  o,  pj,  Tj ) » g3(d,^n,  p,r  ) 
w 

- e4(d(1-pjrj,"~) 

where 

g4(t)  o - - t + In ( l +t ) , t > 0 . 

Since  the  location  of  the  supremum  of  g for  fixed  d,  p , and  r does 

not  depend  on  d,  pT,  and  rT  , we  have 
J J 

3UP  g.(d,w,p,rT)  - sup  ! g„{d(l-p  r )?)  . 

40-PjTj)*  JJ 

If  ( o,  r)  € B then  r , p < i - e and  hence,  since  e < k , 

J J 

h 2 1 2 1 1 

( 1 - Pjr j ) " > {i  - (l-e)  = (2e-e  )£  > e£  . Also  Ru(t)  is  a decreasing 

function  for  t > o and  g(0)  = 0 . Combining  these  results  we  have 

, b b 1 

sup  x g {d(i-or  )-}  < g fe£6/{2k)  } < g{0)  » 0 
" " 

proving  the  lemma . 

If  ( p,  r } € B then  for  all  j (l<j<k-l  ) 

inf  | (i-a.r.)';;-(i-P<.+ir.+i  )"*|  - |{l-(2c)2}""  - (l-e?f^|  „ x(e)  > o . 

B 

l£t  64  > o be  given.  Without  loss  of  generality  we  may  assume  that 

o4  < min{  e,  4x(  e)l  • Choose  i (i<i<2?k)  and  (p,r)  €B  . If 

y <c  - S(o4,  0(p,  r))  then  d(y,  f3(  p,  r ) ) > 64  . By  the  triangle  inequality 
d(y,  0(  p,  r ) ) < d(y,yA)  + d(yA>  0(p,  r)) 

and  thus  d(y,  0(p,r)  > implies  d(yA>  !?(  p,  r ) ) > £t>4  or  d(yA>y)  > |64  . 

If  d(yA,p(p,r))  > ^o4  then  it  follows  from  Lemma  3.2.3  with  6 - ^&4  , 
that  there  exists  a ^ ■ d ( &4 ) » o < 0 < l , such  that 
I h(y,  p,  r )/h{  B(  P>  r ),  o,  r}  | < t)j  . Therefore  we  need  only  consider  those  y 

such  that  d(yA,P(p,r)j  < £o4  and  d(y  ,y)  > ^ . 

■ A 


I 


The  sets  0(k)  , 7<k, p)  , and  7(k,qj  are  bounded  subsets  of 

R^k(k  l)^  R^k(2p  k l)^  and  R-|k(2q-k  i)  ^ respectively.  Consequently 

»il'  N i2*  ^i3’  and  Ni4  ’ which  occur  in  the  definition  of  , are 
bounded.  However  is  not  bounded  because  and  are  not 

bounded.  We  want  to  show  that,  the  set  of  all  y fc  satisfying 
d<yA>  3(p,  r))  < -|64  for  some  (p,  r)  fc  B is  contained  in  a closed  bounded 
subset  of  ^ , and  furthermore,  that  the  u^'s  and  v^'s  of  the  y's 

in  A^  are  bounded  away  from  one  another  and  from  o and  l.  The  properties 
of  A^  will  be  developed  in  the  following  discussion. 

If  (p, r)  € B then  for  all  j (l<j<k) 

_A  -i  -4 

1 < (1  “ °jr  j 5 1 6 " i \ 5 • 

At  the  point  £(p,r)  , u - v - ( l - p r )”^  (i<^j<k)  . Hence 
d(yA,3(p,r))  < |64  implies 


K 4 ? 

£ [lu  -(i-p  r )~?)  ’ + { 
j-i  J J J 


vr(,-p3rj ri,Z] 


and  thus  | u^  - (1-p^r^)  *\  < ^64  and  ( v^.  - ( 1 - p^r-^ ) “*(  < ^ for  all  J . 
Combining  these  results  we  have  after  some  simplification 

-4  c -4 

64<ecUj,  vj<  + ® < 2 &4  B 


for  all  j . Also 
264  < X(e)  < I (1-pJrJ,"^'(1"pJ  + lrj+l 

< i\  * I uj-uj*i  I * • 

Therefore  )uj-u^.+1|  > t>4  (l<j<k-i)  . Similarly  lvj-vj  + il  > \ 

(Kj<k-l  ) . Let  ( 64  ) be  the  set  defined  by 
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DZ(64)  " *(V  •••»zk):264'?  > z!  > z2  > ••'  > zk  > 64  and 


z . - z . , > 6, 
J J+1  *♦ 


(i<J<k-i  )}  . 


It  follows  from  the  previous  discussion  that  if  y t Cl  and 
d(yA»2(P>r))  < ^6^  for  some  (p,  r)  € B , then  y € where 

Ai  - Kil  X Ni2  X VV  X Ni}  X tyV  X Ni4  * 

d^  is  bounded  and  p( p,  r ) is  an  interior  point  of  u for  every 


( p,  r ) c B . Let 


4 = fy  « ui:d(y'yA ) > 


and  let  ~ d^  H x’  . Then  V.  is  bounded.  B = B(e)  is 


also  bounded 


and  therefore  cl(f.xB]  , the  closure  of  f . x B , is  closed  and 
bounded,  and  hence  compact.  Let  h (y,  p,  r)  be  the  function  defined  on 


cldfjXB)  by 


v,  fv  n My,  P,r  ) 

hi(y,p'  5 " h{d(p,r),p,r}  • 


hj  is  non-negative  and  continuous.  Since  (y, p,r)  6 cl(Y  xfi)  implies 

u,  > u„  > ■ • • > u,  >0  and  v,  > v„  > • • • > v,  > o , it  follows  from 
l 2 K 12  k 

Lemma  J.2.2  that  h < 1 . Hence  from  a well  known  result  (see  for 
example  Goffman  (1965),  p.  21),  there  exists  a d(i)  < l such  that 
hjty,  P,  r)  < d(i)  for  all  (y,  p,  r)  6 clf^xB)  . Let 
t)  - max(  (J  , d{ l ),  0(2  ),...,  d(k ) } . Then  we  have  proved 
Lemma  ? . 2 . 4 — 

For  every  6^  > 0 there  exists  a constant  0 — d( 6^ ) , o < 0 < l , 


such  that 


My,  hr) 
hi  3(p,  r),  p,  r] 


, (p,r)  € B , and  i (l<i<2  ) . 


for  all  y fc  ii  - S{64,(3(p,  r ) ) 
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lk 


(viii)  From  (3.2.23)  we  have 

k 1 

(3.2.29)  t{  t,(  p,  r ) } „ cx  11  (l-p.ri) 

i-i 


p.r.  - p.r 
i i j J 


i<j 


1 2 

1 


where 


{rk(^k)}2rk(b)rk(k) 

Ci  ” ?4k  kfk+£(p+q)] 

Since  ( p,  r)  t B implies  e < p , r.  < l - e (l<i<k)  , 

k , . k 2 2 

(3.2.30)  inf  xU(p,r)}  > C il  (l-(i-ef)  (1  > 0 • 

B ‘ i-i  i<j  (l-e  ) 

(ix)  Let  D„  (e)  = (x;x  S(6,  *{p,  r ) ) } be  the  subset  of  D defined 

in  (vi).  D?(e)  is  bounded  and  therefore  cl{D„ (e)}  c D is  compact. 

t is  continuous  on  D and  hence  uniformly  continuous  on  cl{D?(e)}  . 

In  particular,  for  any  > o there  exists  a 0 > o such  that  for  all 

(p,r)  c B,  S{  6 , -3(  p,  r ) ) c cl(D  (e))  and  x € S{6,s(p,r))  implies 

5 2 5 

| T(x)  - T ( ^ ( P,  r ) } | < . 


All  of  the  conditions  of  Theorem  2.3.1  are  satisfied  and  therefore 

(3.2.31  ) Lj  ~ t{  ?( p,  r )}  [f{  i,(  p,  r ),  p,  r}  ]n  [A{ '4(  p,  r )}  ")"H(2n/n  )^(P+q) 

uniformly  in  (o,r)  6.  B . Substituting  (3.2.26)  and  (3.2.29)  into 
(3.2.31)  and  simplifying  gives 

(5.2. 52)  L,  ~C2  il 
i-1 

X n ( ( p.2-p.2 )(r  ?-r  ? ))"a  n n (pi?(r.2-r  2))"^ 
i<j  J 3 i-l  j-k+i  3 

uniformly  in  ( p, r ) t B , where 

{rk<*k)}2rk(fe)rk(£q)e"nk 

C ■ ' • 

n«k(P+q)  25k-|k(p+q)  nk 


j 
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Asymptotic  behavior  of ?-- 

It  follows  from  { * . 2 . 1 4 ) that 


Li2  " / T,(y)(h(y,  P,  r)}ndy  . 


We  can  show,  by  exactly  the  same  argument  that  was  used  in  (i)  to  prove 

the  absolute  integrability  of  t(x)  {f(x,  p,r)}n  on  D , that 

ri  3k 

T](y)  {h(y,  p,  r)}  is  integrable  on  il.  - H for  all  i (l<i<2  ) and 


n > k - 1 . Furthermore 


rn  > / T|(y ) fh(y,  a,  r ))ndy 


U.  -S 

l 


3k 

for  all  n > k - l,  ( p, r)  £ B,  and  i (i<i<2  ) , where 

rn  - Cn  l2Ft (n,  (l -e)Ik>  ( i - e ) Xk ) and  Cn  is  defined  by  (3.2.4). 

Integrability  of  T) hTi  is  the  same  as  absolute  integrability  since  T] 

and  h are  positive  on  ll  and  0.  x A , respectively.  It  follows 

from  the  definition  of  H (3.2.11)  and  Lemma  3.2.4,  that  there  exists  a 

constant  0 , o < 9<  l , such  that  | h (y,  p,  r )/h{  3(  p,  r ),  p,  r}  | < y for 

3k 

all  y s ft  - H , (p, r)  € B , and  i C1^-^2  ) • Therefore 


Ji2' 


I I H(y){h(y,  P,r)}nl  dy 


V2 


< / I Tl(y)(h(y,  p, r ) } 1 


V3 


h(y,  p,  r) 
h[  3(p,  r),  p,r) 


n-k 


h{  3(  p,  r ),  p,  r}|n'kdy 


< Jn-k|  h{  0(  p,  r),  p,  r ) | n_k  / | f](y){My,  o,  r)}k|dy 


a -H 

1 


< <3n"krkl  h{  0(  p,  r),  p,  r } | 


n-k 


We  have 


-1 


h(0(p,r),  p,r)  » f { h,(  p,  r),  p,r)  - e'k  n • 


i.l 


(3.2.33) 
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It  follows  from  (3.2.33),  (3.2.26),  and  (3.2.30)  that 

sup [t { ( p,  r)}  1 [f {b(  p,  r ),  p,  r}  ] k[u{ ';(  p,  r ) ) 1 : ( 2 tt)  q ' ] =>  i < 00  • 


Let  Lj  denote  the  righthand  side  of  (3.2.31).  Then 

| l±2/£1  | < iin-k  n’sk(p+q,rkt  - o as  n -*  <= 

3k 

uniformly  in  (p,r)  f B and  i (l<i<2  ) * Hence  Li2  is  asymptotically 

of  lower  order  of  magnitude  than  Lj  . Consequently 
(3.2.34)  L:  + Li2  ~ Lj  . 

Combining  (3.2.10),  (3.2.12),  and  (3.2.34),  we  have 

Theorem  3.2. 1-- 

Let 


i) 

R =.  diag(rJ , . . .,rp) 

with 

l > 

ri  > 

• • • > r > o ; 

P 

ii) 

P = diag(  px , ...»  P^ ) 

with 

l > 

pl  > 

•"  > pk  > Vi 

iii ) 

A = { ( o,  r ) : l > Pj  > 

...  > 

A 

AS 

o 

0 , 

V 
►s i 

V 

V 

where  p = ( p , . . . , o ) and  r = ( r , . . . , r ) ; and 

IK  *■ 

iv)  for  every  e , o < e < •§  , 

B = B(s)  - { ( p,  r ) <=  A : o.  - Pi+1  > e (0<i<k)  , r.  - rJ+J  > e 

(0<j<p)  , where  PQ  - rQ  - 1 , Pk  + 1 - rp+J  - o)  . 

Then  for  all  ( o,  r ) € A 


(3.2.35)  „F.  (in,  -^n;*q;P‘,B  ) ~ K Il{(l-p.r  ) 
" 1 ni„l 


-n+£(q+p-l  ) .^(P-Q), 

h i 


where 


n nr,  2 2 2 2 .-■§ 

x n n((r  -r  >(o.  -p.  )) 

i-l  3-1  J J 


C^n  )e-nlt  rfcl  fe.  t rfcl  fri } 
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Furthermore  (3.2.35)  holds  uniformly  in  ( p, r ) f B for  every  e . 

As  a check  on  the  preceding  derivation  it  should  be  noted  that  when 
p - k i , (3.2.35)  agrees  with  the  known  asymptotic  behavior  of  the 

classical  hype rgeome trie  function  (see  Luke  (1969,  Section  7.2’.)). 


3.3.  Partial  Differential  Equations  for  Hypergeometric  Functions  of  Two 
Matrix  Arguments. 

In  this  section  the  partial  differential  equations  derived  by 
Constantine  and  Muirhead  (1972)  are  extended  to  include  the  case  where 
some  of  the  latent  roots  of  one  of  the  matrices  are  known  to  be  zero. 
These  results  and  the  results  of  Section  3.2  will  be  used  in  the  next 


,2_2  , 


section  to  find  additional  terms  in  the  asymptotic  expansion  of  2F3(n,P,R  ) 


We  introduce  the  following  notation: 

1)  S is  a p x p symmetric  matrix  with  latent  roots  s1,...,s^  ; 

2)  T - diag(T1>0)  is  a p x p symmetric  matrix  and  Tj  is  an 

i>  x l symmetric  matrix  with  latent  roots  t ,...,t  ; 

1 U 


3)  k - (kj, .. .,kr ) , kj  > k2  > 


> k^  > 0 , is  a partition  of 


k - vi  ^ k into  p parts ; 

4 ) T)  - (k3 , . . . , k , ) , ^3  > • • • > k , > 0 isa  partition  defined  by 

the  partition  a , of  h - 2.'  ^k.  into  L parts; 

5)  Corresponding  to  the  partition  n , let 


^ - (kj,  ...,ki_J>ki+l,ki+1,  ...,k  ) and 


(i) 


(kj,...,k^  3 , k . -1 , k i + , ,,  . . .,  k^  ) whenever  they  are  admis- 
sible, i.e.,  whenever  the  elements  of  the  partition  are  non- 
negative and  in  nonin  rea sing  order- 
6)  Corresponding  to  the  partition  i)  , let 
\ - (fcj>  • • •»ki_1»ki+i,ki+1>  . . .,kj  and 


A 


-64- 

i'/ 1 ' „ (k,...,k.  , , k.  -i,k . , . , . . . , k,  ) whenever  they  are  admis- 
1 1—1  I 1+1  ^ 

sible;  and 

7)  If  t„  (kl(...,k  ) then  C^CB ) will  denote  the  zonal  poly- 
nomial of  the  p x p matrix  B corresponding  to  the  partition 


(k  , . . .,k,,  o, . . o)  . 

1 V 

James  (1964)  proved  that 
(3.3.1  ) C (T)  - C./Tj) 

L 0 


if  k4+i 
ot  lerwise. 


k = c 
P 


From  (3.3.1)  we  have 
(3.3.?) 


{v.< 


(a  ) C (T)C  (S) 

U H H K 
T 


T (b  ’ •••(b)  k(  C (I  ) 

k.0  H 1 1 H 


v y 


(Vn  lV;|c»lVcyS| 


(b  ) 

h-o  i)  1 i 't\ 


(bv)  >:  S(i,) 


where  ait...,a  , b,,...,b^  are  real  or  complex  constants.  We  need  the 
following  differential  operators 


(3.3.3) 


P 

y 


“c  - ~ i 2 

b i-i  asi 


+ y 


, - C . - S . jS . ’ 

1-1  J-l  1 j 1 

U5 


(3.3.4) 


e<5  " 2 ~ 

i-l  l 


(3.3.5) 


■ u.  A • 


and 


(3.3.6) 


Z t 


3 d‘ 


^ • u.  '*  at/ 


• - x r4rr  nr  * 

i«i  i j i 


ji 


Constantine  and  Muirhead  (197?)  prove 
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c (i)(S) 


(3.5.7)  &gCMS)  - Cx(Ip)  I (H(i))  [k.-l+^p-i)]  ^ 

H 

p /H  \ c„(i)(S) 

(3.3.8)  e C (S)  ~ C (I  ) 5)  (i ) 

S h k p iml  \«  / CK(i)(Ip) 

o.j.9)  Vn'V  ' iri-r  if1('nLVi-*t1-' )3° ) 

O.J.io)  - irrT  y (T,) 

i«*i  i. 


and 


(3.3.11  ) 


■ ih*' ,TriT.  >s(t>  1 


where  (a")  is  the  coefficient  of  C ^ ( S ) /C ^ ( Ip ) in  the  "binomial 


expansion" 


C (I  +S)  k CAS) 

p ■ - S V,  (H) 


Ch(V  " j»0  a Ca<V 

and  (^-)  is  the  coefficient  of  C (T  )/C  (I,)  in  the  "binomial 
expansion" 

CVW  - z A.  W 

-j-o?  T ct<V 

(see  Muirhead  (1970)  for  details). 

We  are  now  in  a position  to  extend  the  partial  differential  equations 
satisfied  by  hypergecmetric  functions  commonly  occurring  in  multivariate 
analysis.  We  start  with  the  ?F 1 function 
Theorem  3.3.1  — 

The  function  (a,b;c;T, S)  satisfies  the  partial  differential 
equation 

(3.3.12)  6SF  ♦ [c-£(p-l  )JesF  - [a+b-£(  i-1  ) - T^P  - ab  Tr(T,  )F  . 


r 


Proof — 


(3.3.13) 


00  (a)  (b)  C (S)C  (T) 

;VM^,S,-kfo,  ,c\  \?C<,i  , 


By  applying  the  differential  operator  6 term-by-term  to  the 

S 

series  (3.3.13)  it  follows  from  (3.3.7)  and  (3.3.1)  that 

” (a)f|(b)j|  ^ | \ -n(i)^  ^ 

63  * ■ hfo  U»>  Ck‘-1^-i)I  r-77 

T)(i)  P 

By  applying  e„  it  follows  from  (3.3.8)  and  (3.3.1)  that 


(3.3.15) 


00  (a)  (b)  l , C (i'(S' 

e F » v y :nl  ’T)  {t  i Ti  \ V1' 

3 ! 1 h.o  T|  {e'n h:  1 1 c m,<V 


Using  (3.3.2)  we  may  write  the  ?F]  function  as 


(3.3.16) 


® (a ) n(b)  C (T  )C„CS  ) 
jFj  (a,b;c;T,S)  » S v _L^ 

h-0  T)  ‘C,T|  V V 


By  applying  the  operator  ym  to  the  series  (3.3.16)  it  follows  from 

I 


(3.3.9)  that 


(a)  (b)  C_(S)  II  Tl\ 

„3.m  vT  /,  . 

By  applying  to  (3.3.16)  it  follows  from  (3.3.10)  that 
(3.3.18)  1^  2P1  - 

* (a)  (b)  C ( S ) l , T>  \ 

hfo  l (cVh+1):  "TV  i-i^  T|  J 

Let  A denote  the  differential  operator 

6g+  [c-i(p-i)]cg  - U+b-i(^i  ) 


Combining  (3.3.14)— (3.3.18)  we  have 
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C (i  (S) 

n \ tv  J 


'*’Mn  V.  ■ l ; ^ > J 

T] 


C (S)  4 ,•  T.\ 

( h+i  )C  (T  f ^ °ri  i-i  ) ] [a+b+k.  ~^(  i-l  ) J 

7 T|  P i— i ' 1 


let  C denote  the  coefficient  of  C Tj(S ) in  (3.3.19).  Then 


H\)  \(v 


(a)r,  (b) 


C - ^ 


. ' T)  (h+l)IC  (I  ) ) (c)  [ki+c-^fi-l  )] 

ini  I,  p 


(a)n(b,n 

— [k.-§(i-i  )][a+b+k.-|(i-i  )] 


(a)l|(b)T,  ab 


^ \ 

(c)'h:  ' Cri(If  )(h*l  ) T|  ' CT^(Tl> 


(a)  ,(b) 


ab  C (T  )Tr (T, ) 


(c)  n h.'C.^Ip)  ““  i|ul 


where  the  last  line  follows  from  (3.3.11).  The  theorem  now  follows  from 
(3.3.16). 

Corollary  3.3.1-- 

The  function  Fj (a, b ;c;T, S ) is  the  unique  solution  of  (3.3.12) 
subject  to  the  condition  that  F may  be  expressed  in  the  series  form 

* C,  (T)C  (S) 


F(T,S)  - £ or 


x x 


k-o  x H Cx(Ip) 


where  F(0,0)  - 1 , i.e.,  - 1 


Proof -- 


Substitute  F(T,S)  in  (3.3.12)  and  compare  coefficients  of  C^(S) 


and  then  of  C.  (T  ) on  both  sides.  This  gives  the  following  recurrence 


relations  for  the  .^'s 

(h+l  jrc+k1-i(i-l  )]cr  - [a+ki-i(i-i)J[b+ki-i(i-l)Jo11  . 


1 
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Using  the  fact  that  a ..  i , these  recurrence  relations  determine 
a ^ uniquely  as 

(a)  ,(b^n 

i M 

(c)  hi 

Corollary  3.3.2  — 

The  function  1FQ(a;T,S)  satisfies  the  partial  differential  equations 

(3.3.20)  6gF  - [a+4(p-‘C)]YTP  - 4/-  ift(p-»)Tr  (Tj  )F 
and 

(3.3.21)  6gF  - ^(p-6)egF  - aYmF  - T^F  - ^a(-6-l  )Tr(T:  )F  . 

Proof- - 

(5.3.20)  follows  from  (3.3.12)  by  putting  c - b - ^(p-l ) and 

(3.3.21)  follows  from  (3.3.12)  by  putting  c = b » ^(t-i)  . 

Theorem  3.3.2-- 

The  function  jF  (a;c;T, S)  satisfies  the  partial  differential 
equat ion 

(3.3.22)  JSF  + ^c-Mp-l  ^esF  " >rF  “ a Tr{Tj  )F 

and  the  function  ^(cjTjS)  satisfies  the  partial  differential  equation 

(3.3.23)  6 F + [c-£(p-l  )]e  F - Tr(Tj )F  . 

proof-- 

The  theorem  can  be  proved  using  a proof  similar  to  the  proof  of 
Theorem  3.3.1.  Alternatively  (3.3.22)  follows  from  the  differential 
equation  (3.3.12)  for  ?Fj  via  the  confluence 

lim  ?Fj  (a,b;c;b-1T,S)  - ^(ajcsTjS)  . 


Similarly  (3.3.23)  follows  from  (3.3.22)  and  the  relation 
lim  jF  (a,c;a'1T,S)  - 0Fj(c;T,S)  . 

a - ■*> 
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Corollary  3.3.3-- 

The  functions  ^FjtajcjTjS)  and  0F,(c;T,S)  are  the  unique  solutions 
of  {3.3.22}  and  (3.3.23)  subject  to  the  condition  that  F may  be  expressed 
in  the  form 


F(T,S) 


^ a 


Fa  0 H 


C (T  )C  (S) 

H.  H 

c (I) 

n P 


where  F ( 0, 0 ) = l . 


Proof- - 

Follows  from  Theorem  3.3.2  exactly  as  Corollary  3.3.1  follows  frcm 
Theorem  3.3.1. 

Corollary  3. 3. 4-- 

The  function  oF^(T,S)  satisfies  the  partial  differential  equation 
(3.3.24)  6gF  - VtF  = £(p-'- )Tr(Tj )F  . 


Proof- - 


Put  a = c » ^(p-i  ) in  (3.3.22). 


3.4.  The  Term  of  Order  n * in  the  Asymptotic  Expansion  of 


Ar  indicated  in  Chapter  2,  Section  4,  further  terms  in  an  asymptotic 

2 2 

exran;  Lon  of  ?F1(n,P,R  ) could  be  obtained,  at  least  in  principle,  by  a more 
detailed  analysis  of  the  integrals  in  Section  3.2.  Such  an  analysis  would 
lead  to  an  asymptotic  expansion  of  the  form 

2F1(n,P?,R?)  ~ qG 

2 2 

where  is  the  asymptotic  representation  for  ?Fj(n,  P ,R  ) given  by 


Theorem  3.2.1, 
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P p 
1 2 

G = 1 + — + — r + ••• 

n 2 

n 

2 2 

and  the  P^'s  are  functions  of  the  cn  's  and  the  's  hut  do  not  depend 
on  n . 

In  this  section  the  partial  differential  equation  satisfied  by 
2 2 

^(n,  P ,R  ) will  be  used  to  compute  Pj  . 

If  S is  a p x p symmetric  matrix  with  latent  roots  slf...,s 
T - diag(T1,0)  is  a p x p matrix,  and  T1  is  a k x k symmetric 
matrix  with  latent  roots  t »t  >...,t  , then  by  Theorem  3.3.1 
Fj (a, b ;c ;T, S ) satisfies  the  following  partial  differential  equation 


P 

(3.4.1  ) Z s SJL.  + 
i-i  ds. 


P P Si  a*i  P 

s S 8 — fr-  + £c-i(p-i )}  T 

i-1  J-1  5i  sj  i i-1  ~i 

Ui 


w 

ds  . 


(a+b-k+2 ) Zt.  f--  Zt  ' - V Z r - ■ r 

i-1  1 *1  i-1  1 dt.2  i-1  j-1  \ ' *j  *i 

i/j 


- ab  F T.  t.  . 
i-i 

2 2 

Let  S - R , T-P,  a - b - -3m  , and  c = 3sq  • Then 


dF 


l dF 


d2F 


d F 


l dF 


ds^  2^  dr^ 


2 2 2 3 

ds.  4r.  dr.  4r.  dr. 

l i i l i 


i-l, ...,p 


and 


dF 


1 dF 


o2F 


1 d F 


1 dF 


dt.  2 p . dp. 

l i l 


^2  , 2^2  , 3 . 

dti  4 p4  dPi  4 dpi 


— i-l, . . .,k  . 


Substituting  the  above  in  (3.4.1)  it  follows  after  some  simplifies 


2 2 

tion  that  Fj  [ gn,  -?rn;  |q;P  ,R  ) satisfies 
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dF 


p d2F  p , «.  P P r 

(3.4.2)  Z ~r  + (q-p)  Z ~-+2E  Z — 

1.1  dr.2  1.1  rl  *1  1.1  3.1  r,2  - r.2  «1 

1 U3  1 •> 


3 IF  ^4  d^F 

- (2n+3-2k).s/i'  Jr'  .Vi  7^ 

1=1  l 1=1  dp. 


k k p. 
- 2 Z 2 


dF  „ 2 

. 2 ? dp.  " n F “ pi  ' 

i=l  j = l p - p pi  i=l 


i/j 

The  partial  differential  equation  satisfied  by  G is  obtained  by 
substituting  cfG  for  F in  (3.4.2).  After  a considerable  amount  of 
computation  it  follows  that  G satisfies 


P d^O  ^ P i ip 

(3.4.3)  2 ^ + (2n-P-q+l)  Z ♦ (q-P)  S jr§- 

i-1  dr,  i-i  ripi  i-k+l  i Jri 


P P 

+ 2 Z Z 


dG  y 4 d2G 
- 1 P. 


i«k+i  .jok+i  r.^  - r ri  i«i  1 do. 
i*j  1 J 1 

+ (q+p-2n-3  ) j - — -|2-  + 2 z — - — - — 

i-l  ' ripi  9pi  i.i  1 * ri°i  dpi 


l k 2 l l k .2  k k ri 

- r S p ‘ ♦ — (p—q ) (~(p— q ) + l } Z r ? - Z Z 7 

l-i  i-i  1 i-i  .1-1  (r/-r.2)2 


i 2 2 

k p r . + r . 

. y y 1 

2 2 2 J 2 2 2 ' 
i-l  J-1  (P.  -P,  ) i-i  j-k+l  (r.  -r  ) 

i/j  1 0 1 J 


i J 


ii  u 2 

k k p p 

+ Z Z 1 — iL- 


The  differential  equation  for  PJ  is  obtained  by  substituting 


Pl  P? 

1 +T  + “ + 

n 


for  G in  (3.4.3)  and  equating  coefficients  of  powers  of  n . The 


Pj  is 


resulting  equation  for 
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(3.4.4) 


k 

2 Z 


P.  & k p ' 

2 Z 


4P.  i 5 2 

IT  S P< 


. , l - r.p.  dr.  . , l - r.p.  dp.  4 . , i 

1=1  li  l 1=1  li  l i=l 

j k k k r 2 

+ r(p-q)(p-q+2)  Zr  " - z z — ~ — rr 

i-i  1 i=i  j-i  (r  Z-r  Z ) 


, , 4 2 

k k p.  p. 

+ Z Z — — J- 


i/j 

k p 

- Z Z 


i J 


2 2 
r.  + r . 
i J 


222  “ " 222 

1=1  J-i  (p.  -o.  ) i=i  j=k+i  (r  -r  ) 

Ui  1 J J 


Also  Pt  satisfies  the  boundary  condition 


(3.4.5) 


.2  _2 


2 _2  , 


(P  ,R  ) - Pj  (R  ,P  ) 


since  2Fj(n,p2,R2)  = 2F1(n,R2,pt)  and  cp(n,p2,R2)  = cp(n,R2,p2)  . 

The  general  solution  of  (3.4.4)  is 

h - 9 * >*-■’ 

where  Q is  any  particular  solution,  Y is  an  arbitrary  function,  and 
the  u^  are  any  p + k - l independent  solutions  of  the  system  of 
equations 


1 - °iri 
2 0, 


dr. 


1 ' Vk  . "‘W  , 

“5a drk  " - -T~  dpi 


2p. 


1 


-"‘W 


lpk 


Such  solutions  are  easily  found  to  be 


u. 

l 


1 - Vi 


i=l, . . .,k 


(3.4.6) 


Vi 


1 - Vi 


l - 


°krk 


Pi 


Vi+i  - ri 


i-1 , . . k-l 
i=k+l,  . . .,p 


We  apply  the  boundary  conditions  to  evaluate  Y . If  we  can  find  a 
Q which  satisfies  the  boundary  conditions  then  so  must  Y . Examination 


1 
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of  (3.4.6)  shows  that  a function  satisfying  (3.4.5)  must  be  identically 
constant.  Since  there  is  only  one  boundary  condition  for  there  is 
no  way  to  determine  the  constant. 

A particular  solution  of  (3.4.4)  is 


(3.4.7) 


Note  that 
is 


Q - •§•  v rip.  - i(q-p)(q-p-2  ) £ (r.p.)'1 
i=.i  * i«l 


. i * |1-rihlll-rihllripi<ripil 
2 ..  .2  2 , , 2 2 
(r.  lloj  -Oj  ) 


+ — V 
2 . 


(i  -r . p.  )r. 

li  l 


i»l  j-k+l  (r.'-r^  )pi 

2 2 2 2 

Q(P  , R ) = Q(R  , P ) . Hence  the  general  solution  of  (3.4.4) 


Pj  = Q + 1i 

where  Q is  given  by  (3.4.7)  and  T]  is  a constant. 

Additional  P^'s  could  be  calculated  by  the  same  technique  but  the 
partial  differential  equations  for  P^  are  very  complicated  when  i > 2 . 


i 


L 


CHAPTER  4 

CANONICAL  COTuilA'TT.OII  COEFFICIENTS 
4.1.  Intreduct ion . 

Let  r > r > *••  > r bo  the  squared  sample  canonical  correla- 
l - t - - P 

tlon  coefficients  between  variates  x1,...#x  and  (p<q) 

calculated  from  a sample  of  II  » n l observations  from  a (p+q) -variate 

2 

normal  distribution.  The  exact  joint  density  function  of  the  r^  ’s  is 
given  by  (i . 3.1 ) as 

P 

n 

i-i 

” llr.!|i!<1-p-IV-r,!|il"'P',‘’,l  ^ " ' P 

i<  j 


(4 


.l.i)  f(r.2)  ..  n (i-pi2)^ri  2?a (£a,&i*,tk;P2,Rz) 

i-i 

Ct  n {(r12)^q"P'1)(i-ri2)^(n'P_q'1)}  n (r.2-r  2)  n dr.2 
i-l  i<j  i-l 


where  1 > > p?  > • * • > > 0 are  the  population  canonical  correlation 

coefficients,  E «-  diag(r. , . . ., r ) , P ■<  diag(pj, . . .,  p ) , and 


Fp  ( gn ) n 


n-q ) } r c ^q. ) r ( ) 

j:  P r* 


The  distribution  of  the  r.^'s  depends  only  on  the  's  and  that  part  of 
the  pdf  which  ir./olvos  the  p^'s  is  the  marginal  likelihood  L , i.e., 


(4.1.2) 


L - II  (.-0.2'®n  -F.  (4n,  in;^;P2,R2) 
i-l  21 


The  results  of  Chapter  3 are  used  to  derive  an  asymptotic  expansion 
for  large  n of  the  pdf  (4,1.1)  and  an  asymptotic  representation  for 
large  n of  the  likelihood  function  (4.1.2)  under  the  assumption  that 
the  population  coefficients  satisfy 


(4.1.3) 


1 > Pi  > P2  > • ' °k  > Vl  “ ’ * * “ Pt 
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These  results  are  then  used  to  study  the  Bartlett -I^wley  tests  that  the 
p - k residual  population  coefficients  are  zero  and  to  estimate  the 
population  coefficients. 


4.2.  Asymptotic  Expansions. 

2 2 

An  asymptotic  expansion  of  the  joint  pdf  of  rl  , ...fr  is  obtained 
by  substituting  the  asymptotic  expansion  of  (^n,  fsi;i*q;Fc,R2 ) , given 
by  Theorem  3.2.1  and  (3.4.7),  into  (4.1.1).  The  result  is  summarized  in 
the  following  theorem. 

Theorem  4.2,1  — 

2 2 

An  asymptotic  expansion  for  large  n of  the  joint  pdf  of  rt  ,...,r  , 

the  squared  sample  canonical  correlation  coefficients,  when  the  population 
coefficients  satisfy  (4.13),  is 


(4.2.1  ) 


1 -2 

cp{i  + — + 0(n  )} 

n 


where 


(4.2.2)  cp  - C n (o-r  n )-n+i(p+q-1)(  2)(q-p-2)/4(i.r ^^(n-p-q-1), 


* Ui 

kjr.2-r.2^k  p \ 

n \ -i- i-  n n (r.  -r.:  ) 

KJ-  P.2  - P.2  ' i~i  j-k+i  1 J 

J 


n (lr  p„  , 

iuk+i  k+1  1 " 


(in)n»-^(P*q)e-nk>  rife, rife, rife, 

c - £ 5 

f rv(fcj)2r  (fejr  (fcjr  (i(n-qj) 

A P P P 


n in- p12,fcp1k-ilp*<1)l  , 
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ri  “ S Vi  " 


1 A (l-ripi)(i-rjpj)(riVpJpJ) 


+ 7 Z s 


2 2 2 2 
i<J  (ri  "rj  )(pi  "pj  5 


k p (l-rp)r 

’ r 2 2 -- - h~  * r'  • 

i«*l  j-k+i  (i\  -r  )pi 
-2 

11  is  a constant,  and  0(n  ) are  terms  which  for  fixed  p^'s  and  i\ ’s  are 

-2 

0(n  ) as  n -*  . Furthermore,  for  every  e > ;>  let  B(e)  be  the  set 

(4.2.3;  B(e)  - {(r^,  ...,r  2,  o5?,  ...,  ok?)  : V - V,  > e (o<i<p) 


2 2 

and  p^.  - Pj+1  > e ( 0< j<k ) where 

2 

Then  f(R  ) ~ cp  uniformly  on  B(e)  for  every  e > 0 . 


2 2 . 2 2 . 
p =1  and  r , = p,  , = 0]  . 

p+i  k+l  1 


The  function  cp  is  called  the  "asymptotic  pdf"  or  "asymptotic  dis- 
2 2 

tribution"  of  r ,...,r 
l ’ ’ p 

The  following  corollaries  follow  easily  frcm  Theorem  4.2.1. 

Corollary  4.2.1-- 

An  asymptotic  representation  for  large  n of  the  marginal  likelihood 


function  L when  the  p^'s  satisfy  (4.13),  is 


L - C n {( 

i-i 


p n 2-0 

i<J  J 


2 2 . -h 

j ’ 


2 2 

where  C,  is  a constant  which  may  depend  on  n and  r ,r 

3 l p 

Furthermore,  L ~L  uniformly  on  B(e)  for  every  e > o , where  B(e) 
is  defined  by  (4.2.3). 

Corollary  4.2.2 — 

2 2 

The  largest  k coefficients  r ,...,r  are  asymptotically  suffi- 

1 K 

2 2 

cient  for  the  population  coefficients  p^ 


f 
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Corollary  4.2.3-- 


The  asymptotic  conditional  pdf  of  the  smallest  p - k sample 

z 

P 


2 2 

coefficients  r^+1  >...»r  given  the  largest  k coefficients 


2 2 . 
r.  .....  r,  is 


k p 


2 2 V P 


(4.2.4)  cp  o c n n (r.'-r.")2'  H { (r . 2 ) ^(q“P_1  5 ( l -r . 2 )'S(n"q"p'1  5 } 

0 i=l  j«k+i  1 J ’ i=k+l  1 1 


2 2 

x il  ( r . -r  . ) 
k+i  1 J 

i<j 


where  C,  is  a constant. 
4 


2 2 2 2 

Furthermore,  if  f (r^+J  r “ | r,  , ...,r,^  ) is  the  true  conditional 

2 2 2 2 

distribution  of  r.  , ,...,r  given  r . ....  r,  , then 
k + l p l ’ k 

2 2 2 2 

f(rk+1  , ...,r,  |rt  , ...,rk  ) ~ cpc  uniformly  on  B(c)  for  every  e>  o, 
where  B(e)  is  defined  by  (4.2.3) 


The  asymptotic  pdf  <p , defined  by  (4.2.2),  contains  "beta-like” 

2 2 

factors  linked  by  factors  of  the  form  r.  - r.  . By  making  the  trans- 

* J 

formation  of  variables  suggested  by  Hsu  (1941b)  it  is  possible  to  obtain 
a "norma 1-type"  approximation  which  no  longer  contains  linkage  factors 
corresponding  to  distinct  population  coefficients.  Hsu  let 


(**.2.5) 


and 


(4.2.6) 


zi  ■ 


2 2 , 
n ri  "Pi  5 

2p.(l-0.2) 


( l < i<  k ) 


Zj  " ""j 


(k+i< j<p ) . 


Making  this  transformations  of  variables  in  (4.2.2)  and  simplifying  gives 
Corollary  4.2.4 — 

The  asymptotic  joint  pdf  of  z , ...,z  , when  the  p.’s  satisfy 

i p l 

(4.1.3),  is 


ml  r 
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{**.2.7) 


k P \ P i,n  _ . . P 

n n g(z  ) ' c exp  -Is  zi  n z 2 q p"  n (z -z  ) 

[ id  1 5 ' j-k+i  J j-k+i  J k+1  J 

i<j 


i k z (q-p-a+2(p-k)  + 7 p. 2 ) 

, * o r.  1 1 

i + n \ 2 


(l-3pi2)zi3 


id 


2pi 


+ y 


k Pid-Pi  )z  ; 

“ ■ + 0(n  ) 


2 2 
J-1  pi  - Pj 
j/i 


where  g(z^)  denotes  the  standard  normal  density  and 


2(p-k){q-k)r  r|{q-k)}r  ,{f(p-k)} 


p-k 


p-kl 


Here  0(n  1 ) means  terms  which  are  0(n  ) as  n - °=  uniformly  on  any 


bounded  set  of  z.'s. 


l 

The  first  line  of  (4.2.7)  shows  that  asymptotically  the  z^'s  which 

correspond  to  distinct  nonzero  p^'s  are  standard  normal,  independent  of 

z.  (i/0)  • The  z.'s  corresponding  to  zero  population  coefficients  are 
3 % 

dependent  and  their  asymptotic  distribution  is  the  same  as  the  distribu- 
tion of  the  latent  roots  of  a (p-k)  x (p-k)  matrix  having  the  Wishart 
distribution  W . (q-k,  I ) . 

p-K  p-K 

Hsu  (1941b)  found  the  limiting  distribution  of  the  z^'s  when  the 
population  coefficients  have  arbitrary  multiplicity.  His  result  reduces 
to  the  first  line  of  (4.2.7)  when  the  nonzero  p^'s  are  distinct. 

Sugiura  (1976)  found  the  limiting  distribution  of  the  z^'s,  up  to  and 
including  terms  of  order  n 1 , when  the  population  coefficients  have 
arbitrary  multiplicity  and  are  nonzero.  If  p = k then  (4.2.7)  gives 
the  leading  term  and  term  of  order  n ‘ in  Sugiura's  expansion. 


I 


L 
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Asymptotic  moments  (i.e.,  moments  with  respect  to  the  asymptotic  dis- 


2 2 

tribution)  of  r.  when  p.  / o is  simple,  can  be  obtained  from  (4.2.7). 
1 1 


In  particular  we  have 


{l-P,  ) 


k 


2‘  2 " "i  ' - • * ^-(i-P,2).2  F>  + 0{n"Z ) , 


(4.2.8)  E{r,  ) . pt  + sq+p-l-2p,  +2 


Pi  " Pi  1 

j/i 


2 2 
(1-p.  ) k p.  i 

(4.2.9)  E (r  ) = p + t— — 1 — q+p-2-p  ' +2(l-p  "')  Z — s + 0(n"‘ ) , 

i i tnu.  l 1 . _ c.  c 

1 J-l  P,  - p,  ! 

3 


and 


(4.2.10) 


(1-°i  > 

Var  r.  ) = + 0{n  ) . 


1 n 

These  results  agree  with  the  results  obtained  by  Law ley  (1959). 


4.3.  Tests  of  Signifi  ance  of  the  Residual  Coefficients. 


In  this  section  we  examine  the  Bartlett-Lawley  tests  of  the  null 


hypothesis  H that  the  residual  p - k population  canonical  correlation 

K 


2 2 2 

coefficients  are  zero  given  that  Pj  > 02  > • • • > > 0 . The  approach 


followed  here  is  similar  to  the  one  used  by  James  (1969)  in  connection 
with  the  Bartlett-Lawley  tests  for  equality  of  the  p - k smallest  latent 
roots  of  a covariance  matrix. 


Bartlett  and  Lawley  studied  tests  of  H,  using  test  statistics  of 

K 


the  form  p T,  , where 
k 


■In  n { l -r  ) 

i-k+i 


2 2 

and  P is  a correction  factor  which  may  depend  on  n and  r ,...,r  , 

1 K 


2 2 

but  does  not  depend  on  r^+1  ,...,r  (see  Chapter  l,  Section  3).  It  is 
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2 

well  known  that  n has  an  asymptotic  \ distribution  with 

(p-k)(q-k)  degrees  of  freedom.  The  values  of  0 were  chosen  to  improve 

. . 2 
the  rate  of  convergence  of  the  test  statistic  p to  its  asymptotic  \ 

distribution  by  improving  the  agreement  between  the  moments  of  P T^  and 

2 

the  corresponding  moments  of  \ , , . Lawley  (1959)  suggested  the 

lp-K ) ( q-K ) 

use  of  the  statistic 


The  p . 
l 


2.„ 


L,  = {n  - k + :?(n+q+i  ) + 5)  r -t}T, 

^ 1=1  1 k 


( l < i<k ) are  nuisance  parameters  in  these  tests.  Since 


2 2 2 2 

r r are  asymptotically  sufficient  for  p , ...,p  (Corollary 

IK  IK 

2 

4.2.2),  the  effect  of  the  p ' s can  be  eliminated,  at  least  asymptotically, 

2 2 

by  using  the  asymptotic  conditional  distribution  of  r , . r given 

K+1  p 

2 2 

r , ...,r  . The  technique  used  here  is  to  find  the  appropriate  factor 

1 K 

0 by  using  the  conditional  density  cp  given  by  (4.2.4)  to  compute  the 

mean  and  variance  of  T,.  . The  result  confirms  Law  ley's  choice  of 

and  provides  some  information  on  the  accuracy  of  the  approximation. 

If  k = 0 then  H,  says  that  all  the  p.’s  are  zero.  In  this  case 
k l 

2 2 

is  the  exact  null  density  of  rj  , ...,r  where  the  linkage  factor 


k p 

n n 


<ri  ~rj  > 


i=l  j=k+l 

is  taken  to  be  unity.  Bartlett  (1938)  proved  that  in  this  case 


r 2 

-(n  - ?t(p+q+i)}ln  II  (l-r.1') 

j-1  3 

2 

is  asymptotically  * on  pq  degrees  of  freedom. 

2 2 — 1 2 —1 

Since  r^  = p^  + 0,  (n  s)  (l<i<k)  and  r . = 0^(n  ) ( k+l<_j<p ) 

2 2 

(see  (4.2.5)  - (4.2.7)),  r.  /r . is  large  for  large  n with  probability 
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arbitrarily  close  to  l . Thus  when  testing  that  the  last  p - k popula- 
tion coefficients  are  zero  when  k + o , we  could,  as  a first  approxima- 
tion, ignore  the  linkage  factor 


k p 2 2 h 

il  n (r.  -r.  r . 

i-i  j-k+i  1 J 

The  resulting  distribution  is  approximately  the  null  distribution  of  the 
squared  sample  canonical  correlation  coefficients  with  n , p , and  q 
replaced  by  n'  = n - 2k  , p*  - p - k , and  q'  = q - k , and  suggests 
the  modified  statistic 


-{n'-^(p'+q'+l  )}  In  II  (i~r.‘  ) = -(n-k-^(p+q+l ) } In  n (1-r  ) 


j=k+i 

which  is  asymptotically 


J-k+i 

The  multiplying  factor  is  not 


(p-k)(q-k) 

quite  the  same  as  the  one  suggested  by  Bartlett  (1947)  who  replaced  n 
by  n - k instead  of  n - 2k  . Lawley's  (1959)  correction  factor  arises 
by  taking  into  account  the  linkage  factor. 

Let  denote  expectation  with  respect  to  the  asymptotic  condi- 

tional distribution  (4.2.4).  Then 


-e?  - 


- " 7 3 3-1  v ^ 

. x r’  .'-JtlJli-  3V  " 


— Ec  n (1-ri  ’ > 

* L ^ Jh.o 


V'e  can  interchange  trie  order  i f differentiation  and  integration  in  {4.3.1 


and  (4.3.2)  be can  ? 


X r. 


exists  and  in  a neighborhood  of  h 


i - a (l-r,  ) ^h'1  < 2 S r.2 

4 < bil  ^ 

i a . • 

(see  • ••  examine,  Burrill  (19V 2),  p.  119).  v/e  will  actually  compute 


n {l-r  ) 

i-k-J 


- ihen  *Jse  ) '*nd  (4. -.2)  to  calculate  s(T,  ) end  S (T  2) 

C k C k 

It.  lo  .ws  fr.  -n  Thecr-»7.  4.2.1  that,  for  large  n the  true  condi 


ticna!  •! -istriout  i-.  1.  . 


(4.3.3) 


1 k P r . ( 1 -r . p.  ; 

1 + - Z Z — r~~ 

i-i  j-k-M  p.  (r.  -r  ' ) 
i i 3 


0(n'2)  > 


where  <p  <s  the  asymptotic  conditional  rdf  defined  by  (w.?.4j.  The 
error  ir.  the  conditional  pdf  is  0(n~‘ ) , but  since  the  r.:  s,  (k+l<i<p) 
vary  only  by  0(n  ) with  robability  arbitrarily  close  to  1,  it  can  be 

shown  by  essentially  toe  sar?  argument  that  we  will  use  to  compute 

- h * 

SC  n (l'ci  ' > » 

U i-kei 

tha*  ’.he  relative  error  in  • 


ecn  n {,-ri  5 
1 »k  * 1 


-2 

is  0(n  ) 
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Expand  the  linkage  factor  in  a Taylor  series 

kP  2?  -ikp  2 2?i 

(4.3.4)  n n (r . -r.  )2  - li  n {r.  (l-r.  /r  )}! 

i-l  j-k+l  1 J i-i  j-k+l  J 


k p ri 

n n r v 1 - -±— 

i-i  j-k+l  J 2r. 


+ 0(r  ) 


j 


i k . : 1 p 

- . II  (r. )P  3 SI  - & £ 


where 


i-l 


S r 
i-l 


(1 


j-k+l 


2 4 

rj  * 0Irj  1 > 


and  0(r.  ) means  terms  which  are  at  least  fourth  order  in  the 
J 


r.  (k+l<j<p) 
J 


Let 


(4.3.5) 


f.-C.  ^ nri2)4lq-I’-1,(.-r1!,i,n-‘‘-'=-,')S  (r’-r*, 

0 Dj. 1 1 


i-k+l 


k+l 
i<  j 


where  1 > r,  . > 

— k+l  — 


> r >o  and 

- P - 


'^(P~kl  p„  Ji(n-2k)l 


rp_k( ^(p-k ) ) rp_k(*(n-q-k) ) rp_k{ ^(q-k) } 


fQ  is  the  exact  null  distribution  of  the  squared  canonical  correlation 
2 2 

coefficients,  r,  . ,...,r  , between  two  sets  of  variates  w .....w  , 

k+l  ’ P l ’ p-k 

and  z.,...,z  , calculated  from  a sample  of  size  n - 2k  + l observa- 

1’  q-k 

tions  from  a (p+q-2k ) -variate  normal  distribution.  will  denote 


expectation  with  respect  to  f 


Note  that  f is  obtained  from  V 
0 c 


2 2 

by  ignoring  the  linkage  factor,  expanding  the  domain  of  r^+1  , ...,r 


- 2 2 

from  r.  > r,  , > 

k - k+l  — 


2 2 2 
>r  >o  to  l > r,  . > • • • > r >o,  and 

- p - - k+l  — - p - 


adjusting  the  constant.  It  follows  from  (4.3.4)  and  (4.3.5)  that 


-84- 


P 2 h : 

(4.3.6)  E < n (l-r.  ) > - C„  E. 


"'C  N ■ i 

1 i-k+l 


7 N 


P 2 ^ 1 . P 2 k 1 

n ( l -r . ) l - fur  2 r.  + 0 ( r . ) s 

’ j Jj 


i-k+l 


j-k+i 


P 2 h 

- f V il  d-r  ) 

D i-k+l 

2 2 2 2 2 2 
where  D - ( (r,  r ) : l > r,  , > r,  , r.  , > • • * > r >o) 

k+l  p - k+l  - k ’ k+l  - - p - 

and 


C - C C _1  n r p_k 
” u4  6 i 

l-l 

We  will  compute  the  expectation  with  respect  to  the  null  distribution  which 
occurs  in  (4.3.6)  and  then  shew  that  the  integral  in  (4.3.6)  is  of  lower 
order  of  magnitude.  We  need  the  following  lemma. 

Lemma  4 . 3 . l -- 

!/  P 


, , N P 2 

(i)  E„  :(  Z r n (l-r  ^ 
j-k+l  J i-k+l 


N 


(p-k? { q-k) 

r.  - 2k  + 2h  Eo(h) 


and 


. P _ 2 \ 2 l 2.\  A E0(h) 

{ii)  En  ; .V'k+Irj  i1^  (1’ri  5 > ” (n+2h-2k-i  )(n+2h-2k)(n+2h-2k+2) 

where 

A - (p-k) (q-k ) [(n+2h ){ (p-k) (q-k)+2 ) + (p-k ) (q-k ) ( l -2k )-2p-2q]  , 
and 


r ,h! 

V1”  - V ",  "-ri  > 

i-k+l 


Proof- - 

The  proof  uses  some  properties  of  multivariate  Beta  distributions 

which  can  be  found  in  Kshirsagar  (1972). 

2 

Let  u^  - l - rk+^  (i<i<p-k)  . The  null  distribution  of  Uj,...fu  k 
is  the  same  as  the  distribution  of  the  latent  roots  of  a (p-k)  x (p-k) 
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matrix  U = (u.  . ) having  a multivariate  Beta{£(n-k-q),  ij(q-k)}  dis- 
t J 

tribution. 

(i)  Note  that 

p a 

S r _ tr(I-U)  . 
i-k+i 

Since  the  diagonal  elements  of  I - U all  have  the  same  expectation  we 
need  only  find  the  expectation  of  the  first  element  A = l - u ^ and 
multiply  the  result  by  (p-k)  . 

Fut  U = TT'  where  T is  a lower  triangular  (p-k)  x (p-k)  matrix 

2 2 2 
Then  t . ,...,t  , , are  independent,  t..  has  a 

u p-k, p-k  ^ ’ li 

Beta  {-§(n-k-q-i+l  ),  ?(q-k)}  distribution,  and  A - l - ut 
Hence 


l - t. 


( 


i-k+i  l-l 


r 


p-k 


2h 


> 


1 1 


P“k  ph  | p ?h  1 

e_  n t. . \ - e„  t,f  n t, . S 

N i-i  11  . 11  i.i  u ' 


p-k 


It  is  easily  shown  that 

p-k 


e„  s t,  ‘ n t . . > - 


2b  n - k - q + 2h 


-H^  ll  . 1 "ii  n - 2k  + 2h  0 


EJh) 


so  that 


q - k 


En  "i-V1"1,1  1 ] “ n - 2k  + 2h  B0 


E JM 


proving  (i). 

(ii)  Note  that 


' P 

i s r 

' i-k+i 


^ ) - {tr(I-U))?  . 


I 
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It  follows  from  the  fact  that  the  diagonal  elements  of  U are  identically 
distributed  that 


i / p 2\ 2 p 2 h 

V'  s ri  ) 11  ^-ri  > > - ek 

w i-k+i  i-k+l 


'r  p-k  1 2 p-k  . 

S (1-U..)>  flu.. 


i-i 


li 


En L(  (P-k ) " - 2(p-k)Zun  + (p-k)un" 


i-l 


li 


p-k 


h- 


+ (p-k)(p-k-i  )u11u22)  flu..'] 

i=l 


5 i P_k  ? • , F'k  ?h  1 

(p‘k)  en  ‘ - ?Cp‘k)  en  jSi  .^ii  js 


' u p_k  2h  i 

( p — k ) EN  i ^l  l .a.\±  , 

1-1 


( 


p-k 


+ (P-k)(P-k-i)  en  ' tn‘(t2l2n222)  Ht,.2^ 


i « l 


li 


The  first  three  expectations  follow,  as  in  the  proof  of  (i),  from  the 


definition  of  EQ(h)  and  the  fact  that  the  t_  's  have  independent  Beta 


distributions.  The  results  are 


3 


(**.3.7) 


2 | 2h  2 

(P-k)  ew  . ; n t n - (p-k)  e (h)  , 

N : i-i  11  1 0 


«*•>-*>  -ap-M2  eb  t,;  "£tu2h  • . vh>  > 


and 


(4.3.9)  (p-k)  E - -u 


. 4 p;k.  2h  (p-k ) (n-k-q+2h+2  ) (n-k-q+2h ) „ 


I 


i-i 


j 


The  last  expectation  is 
(p-k ) (p-k-i ) 


. [,  2.  2P‘„k.  2h' 

“■I  *11  *22  "‘li  > 
1»  1 


(n-2k+2h+? ) (n-2k+2h) 


B J . . P-k  2hl 

+ en  Si'-  nt* 


^ ill  “ /J 
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By  the  same  reasoning  as  in  ( i ) we  get 

p-k 

(4.3.10)  (p-k)  (p-k-l  ) E ' tu  t22  nt 

i=i 


2h 


To  compute  E 


(p-k)(p-k-i  )(n-k-q+2h)(n-k-q+2h-l  ) 
" (n-2k+2h) (n-2k+2h-l  ) 

2 g 2h 

t^  t2J  II  t^  ,s  partition  T as 

las  1 


Vh>  • 


‘11 

rt  rn 

‘21  22 

2 p-k-2 


p-k-2 


*t  0 

11 

t t 

* 

21  22 

where  T and  T are  lower  triangular  and 
1 1 22 


'll 


T and  T are  independent.  T T . has  a multivariate 
ll  22  1111 

Beta  {|(n-k-q),  i(q-k)}  distribution  and  T has  a multivariate 

Beta  ( ^(n-k-q-2 ),  ^(q-k))  distribution.  It  follows  that 

Tn  , t33?»  •’•»tp-k^r-k  are  indePendent-  ^ 


V 


21 


"-S, 

2 2 2 2 

Then  t , t , and  w are  independent,  t?1  - w (l-tn  )(l-t22  ) > 
and  t 2,  t2?2,  and  w?  are  distributed  as  Beta  (-^(n-q-k),  ^(q-k)}  , 
Beta  (i(n-q-k-l),  i(q-k))  , and  Beta  (i,£(q-k-l)}  , respectively. 


Therefore 
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2 2 p-k  2h  ? ? P ? p'k  9h 

EN  *11  *21  ' “EN  ,n  *11" 


l 


i«l 


ial 


T")  _ V 

2 2 2 2 ^ 2h 

Vw  ,ek  tn  I'-Si  l'1-^  I Vil 

i«  l 


After  some  simplification  we  have 


(4.3.U)  (p-k)(p-k-i  )En;  tn?t21?  nt..  s 


p-k 


2h 


(r.+2h-q-k  ) ( q-k  ) (p-k ) (p-k-i  ) 
(n+2h-2k+2)(n+2h-2k-l )(n+2h-2k)  o 1 


Adding  (4.3.7)  - (4.3.11)  we  have 

■I  p 2\2  p 2 hi 

en  <1  Sr|  n (i-r  , v 

w ;'j=k+l  3 i=k+i  t 


A Eo(h) 


(n+2h-2k-i  ) (n+2h-2k  ) (n+2h-2k+2 ) 


proving  (ii). 


It  follows  from  Lemma  4.3.1  and  the  fact  that  o < r^"  < l (k+i<j<p) 


that 


(4.3.12)  E 


P h ] P , 

n (l-r.2)  ( 1 - s r 2 + 0(r*!  )s 
i-k+l  1 1 j-k+i  3 

E (h,  u(p-k ) (q-k ) , -2 ' 

" Vh)  1 2(n+2h-2k)  0(n  5 ! 


We  have  used  the  fact  that  for  some  constant  K 


2 h _ . 4 


, P 


h p 


E„  \ n ( l -r . ) 0(r  )'  < KE„  n (l-r.  ) 2 r 


2 \ 2 


N . . ' i 

i-k+1 


J ! N I i-k+i  1 ’ ' i-k+1  1 1 1 


0(n  2 ) . 


Now  consider  the  integral  in  (4.3.6).  We  have  from  (4.2.4) 


P 2 h 

/ n (l-r  ) 

D i-k+l 


c 1 n 

D i-k+1  1 


P 2 2 k P 2 2- 

x n (r,  -r  ) n n (r  -r  z) 
k+l  1 3 i-i  j-k+i  J 

1<J 
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x n dr. 
iok+l  1 


< C f (i-r  2,l(n-q-p-i+h)  n dr  , 
-S  j 1 K+l  5 i-k+l  1 


since  0 < r.  < l 
— l — 


<-C4  /,  (1-rk+l2)4(n"q'P'1+h)drk+12 

rk 

c Ck  d-r  ")^(n-q-p+1+h) 

— 4 k 


C C _1c  ( l -r  2,4(n-<l-P+l+h) 
LUL6  V rk  ’ 


where  C^.  is  defined  by  (4.3.5).  We  can  expand  C,  using  the  following 
expansion  of  a gamma  function  (see,  for  example,  Magnus,  et  al.  (1966), 

p.  21  ) 

In  F(n+c)  = (n  + c - £)  lnn-n  + ^ln2n  + 0(n-1) 

to  get 

C6  - Mn-(q-k,(l  + 0(n-1 )) 

where  M is  a constant  which  does  not  depend  on  n . Then 

(4.3.13)  | / cp  n ( l -r  2 ) | = C C ''Mn^'^d-r  2)^{n'q'P+1+h,{l  +0(n-1)} 
D i-k+1  b K 


- C?  0(p 


for  some  g,  , 1 “ rk  < n < l , where  C?  is  defined  in  (4.3.6).  Com- 

bining (4.3.6),  (4.3.12),  and  (4.3.13)  we  have 


; i 


n (l-r  ) 
-k+l 


h ; 


) 


C?LEo(o){i 
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where 


From  (4.3.1 ) 


- EQ(h)g(h) 


, , of(p-k ) (q-k  )h  . , -3, 

6(1  “ “ (n-2k ) (n+2h-2k ) + 0(n  ' ' 


W - " ar  feo(h)g(h)]h-o 

- -E  ' ( 0 ,g(  0)  - + 0(n-3, 

(n-2k) 

- -E  ’ ( 0 J { l ♦ 0(n‘3)}  - ] + 0(n"3)  . 

(n-2k) 

Similarly,  from  (4.3.2) 

EcITk?’  • 

on 

- E ”(0){l  + 0(n"3)}  + 7*1  E ’ ( 0 ) + 0(n-3  ) . 

(n-2k ) 

* i t 2 

Now  -E  (o)  - E,. (T.  ) and  E (0)  = E,,(T,  ) where  T.  is  the  likeli- 

0 N k 0 N k k 

hood  ratio  statistic  for  testing  the  independence  of  the  two  sets  of 
variates 
proved  that 


w , ...,w  and  z ,...,z  (see  (4.3.5)).  Bartlett  (1938) 
1 p - a 1 q-K 


W - 


and 


EN(Tk 


2)  _ tp-k)(q-k)(  (p-k) (q-k)+?l  + ^-3^ 


(n-2k-X) 


where 


\ - £(p+q-2k+l ) . 


Therefore 
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w - - llp'1'llq;tl  * o'-'5) 

c k n 2K  \ (n-2k) 


(p-k)(q-k2  + -3 

n-2k-\+o 


and 


2 t p-k ) (q-k  ) { (p -k ) ( q-k  ) + 2 ) _ 2o(p-k)'  (g-V-T  + . -3 

C K (n-2k-X)  ■ “ ' 


(n-2k)  (n-2k-X) 


(p-k ) (q-k  ) ( (p-k ) (q-k ) • 2 ) 
(n-2k-X-t-or)? 


+ 0(n-3) 


Thus  the  appropriate  multiplier  of  is 


_ t 

n - 2k  - X + u»n-k  - i(p+q+l  ) + v,  r^  " . 

i-i 

Theorem  4.3.1  summarizes  the  previous  discussion  and  Lawley’s  (1959) 
result. 

Theorem  4 . 3.1  — 


The  statistic 


k -2. 


I.  - >{n  - k - £(p+q+l  ) + 2 r } In  il  (l-r  2) 

K i-i  i-k+l 

is  approximately  distributed  a3  x‘  on  (P-k)(q-k)  degrees  of  freedom. 

2 2 

If  the  observed  values  of  r , ...,r  are  all  near  one,  then  the 

1 K 

multiplying  factor  in  L.  is  approximately  n - £(P+q+l)  , which  is  the 
value  suggested  by  Bartlett  (1938). 


4.4.  Maximum  Likelihood  Estimates. 

Anderson  (1965),  James  (1966),  and  Chikuse  (1974)  have  noted,  that 
in  the  case  of  the  latent  roots  of  a covariance  matrix,  estimates  of 
population  latent  roots  obtained  by  maximizing  the  likelihood  based  on  the 
sampling  distribution  of  the  sample  roots,  provide  a correction  for  bias. 


{n-£(p+q-l  )) 


-93- 


I 


Setting 


3 In  b 
d pi 

equal  to  zero  and  solving  we  obtain  t he  maximum  marginal  likelihood  estimate 
Pi  of  pi  as 


(4.4.2)  pi  - r. 
It  can  be  shown, 

(4.4.3) 


(l-ri ) i 2 2 k rj  1 

p+q-2+r  +2(l-r  ) S — Y 2 > 

2nri  , -rJ 


using  (4.2.5)  and  (4.2.6),  that 


PiU-oi  ) _2 

E(p.)  - Pt + 0(n  ) 


0p(n"2 ) 


and 


(4.4.4) 


Var(p.  ) 


2 2 

U-Pi  ) 


+ 0(n"2  ) 


These  results  are  similar  to  those  obtained  in  the  case  of  a simple 
correlation  coefficient.  Let  r be  the  maximum  marginal  likelihood 
estimate  based  on  a sample  of  size  n l from  (x, y)  , where  x and 
y have  a bivariate  normal  distribution  with  correlation  P . Then 


E(r)  - p 


£LL-£_L 


0(n"2) 


and 

2 2 

Var(r)  * - ~ p ■■  + 0(n'2)  . 

Fisher’s  z-transform  applied  to  r stabilizes  the  mean  and  variance. 
That  is,  if 

- tanh  !p  - \ In  y~ 

and 

« -T  i , l+r 

5 - tanh  r » ? in  — r 
l-r 


f 
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then 


-2  , 


E (§)-§  + 0(n  ) 


and 

Var(  I)  = n 1 + 0(n  2 ) . 

This  result  suggests  applying  Fisher's  z transformation  in  the 
canonical  correlation  case.  Let 


1 + P, 


§i  - tanh  ° I In  — ^ i-l,  ..  .,k 


and 


l+r. 


^ = tanh"Vi  - % In  i-l,...,k 


where  z^  is  the  usual  maximum  likelihood  estimate  of  . Lawley 
(1959)  notes  that  this  transformation  fails  to  stabilize  the  mean  and 
variance  to  any  marked  extent.  In  fact  z^  has  a bias  term  of  order  n 
Let 


-l  * 1 

(j.  - tanh  p.  = 5 In 
l i 


1 + ?< 


I*  l f . . . f k . 


l-P. 


From  (4.4.2)  we  have 


(4-4*5)  - zi  - i^7',p+(1-2+ri2+2(1-ri2)  ,5.  —r-2 

i ! W i r.  -rj 


, + °p(n"2)  • 


Using  (3.4.3)  and  { 3 . 4 . u ) we  can  show  that 

E(^)  - s + 0(n'2) 

and 


Varf^ ) - n"1  + 0(n*2  ) . 
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The  effect  of  Fisher's  z-transformation  is  to  provide  the  same  type  of  bias 
reduction  and  variance  stabilization  as  in  the  correlation  coefficient 
case. 

As  a numerical  example,  suppose  that  p=q=3,  n ~ l 01  and  that 
the  sample  values  are; 

r « . 560  , r = . 090 


rj  ~ .630  , 


.741  , 


. 633  , 


z?  B .090  . 


From  (4.4.5)  the  maximum  marginal  likelihood  estimates,  ignoring  terms  of 

-2 


order  n 


are 


. 0682 


.673 


z + . ol 98  = .653 
2 


Z?  + . 0030  = . 093  . 


An  estimate  of  the  standard  deviation  of  5,  is  s^ 


The  difference 


between  ^ and  z is  about  . 7s  , and  is  not  trivial.  It  should  also 
be  noted  that  c < z and  £ > z,  and  hence  the  effect  of  L in 

providing  a bias  correction  is  to  move  the  estimates  closer  together. 

One  should  note  that  the  validity  of  (4.4.5)  depends  in  large  part 
on  the  spread  of  the  sample  roots.  If  the  sample  roots  are  close  together, 
and  this  will  happen  as  p gets  large,  then  will  get  large  due  to 

the  terms  of  order  n-1  involving  (r^2-r  2)_1  . 

In  the  likelihood  function  of  a particular  coefficient  or  set  of 
coefficients  the  remaining  coefficients  are  nuisance  parameters.  One  can 
eliminate  their  effect,  at  least  asymptotically,  by  deriving  the  marginal 
likelihood  function  from  the  conditional  distribution  of  the  sample 
coefficients  corresponding  to  those  of  interest  in  the  population,  condi- 
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tional  on  the  coefficients  corresponding  to  the  nuisance  parameters  as  in 

Section  2.  An  alternative  approach,  suggested  by  James  (1966)  in  the 

case  of  the  latent  roots  of  a covariance  matrix,  is  to  average  the  joint 

likelihood  function  given  in  Corollary  4.2.1  over  seme  reasonable  posterior 

distribution  of  the  nuisance  parameters.  James  argued  that  if  these 

nuisance  parameters  are  somewhat  removed  from  the  parameters  of  interest 

then  the  likelihood  obtained  by  this  averaging  would  not  be  very  different 

from  that  obtained  from  the  joint  likelihood  by  replacing  the  nuisance 

parameters  by  their  maximum  likelihood  estimates.  The  log-likelihood  of 

a single  population  coefficient  p.  would  be,  from  (4.4.1), 

1 

K + ^n  ln(l-pi2)  + {^(p+q-l)-n)  lnfl-r^)  + {k--jj(p+q)}  In  pi 

2 2 
i-l  / p.  \ k .0.  v 

-i  I lnii-^1  - U in'  - 1) 

j-i  v Pj  j-i+i  ' Pj  I 

where  K dees  not  depend  on  p^  . The  log  likelihood  of  a group  of 
population  coefficients  would  be  obtained  from  (4.4.1)  in  a similar 
manner. 


CHAPTER  5 


AN  ASYMPTOTIC  EXPANSION  OF 


'R  LARGE  n, 


5.1.  Introduction 


Let  ^{^,6,1)  denote  the  hypergoometric  function 
1F1(^(n1+n2);^n1;^n2©,  L)  where  L « diagC-t^,  . .., -I  ) with 
1 > l.  > •£.„  > • • • > -L  >o  and  © ...  diagi  a .....  a ) with 


> > o . This  hypergeometric  function  occurs  in  the  pdf 


of  latent  roots  which  arise  in  MANCVA  and  discriminant  analysis  (see 


Chapter  l,  Section  4).  Assume  that  © . diag(©  , 0)  where 


The  technique  presented  in  Chapter  2 is  used  to  derive  an  asymptotic 


expansion  of  F (n  , 0,  L)  for  large  n 


5.2.  An  Asymptotic  Representation  of  F 


(n  , 0. 1 ) will  be  obtained  using 


An  asymptotic  representation  of 


Theorem  2.3.1.  The  derivation  is  essentially  th' 


ame  as  the  derivation 


of  the  asymptotic  representation  of 


n which  was 


presented  in  Chapter  5,  Section  2.  The  reader  is  referred  to  that  sec 


tion  for  the  details  of  this  derivation 


We  begin  by  applying  the  results  of  Chapter  l,  Section  2 to  express 


as  a multiple  integral.  Fro".  (1.2.6)  we  have 


Partition  H into  two  subnatrices  H.  and  H consisting  of  the  first 


k and  last  p - k columns,  respectively.  It  follows  from  (1.2.5)  and 
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the  fact  that 


® » diag(01,0)  that 

" I +n  );£n  ;4n,©A  'iH  ® (dH)  . 

0(p)  2 121111 


Because  the  integrand  does  not  depend  on  H2  , we  can  integrate  over  H? 
using  Lemma  3.2.1.  The  lesult  is 

F(n,*L)-  / F (k,{-|(n  m );&i  ;fci  ©A  'lH  0 A (dH.  ) . 

V(k,p)  2 121111  l 

For  nj  + n?  > k - l we  may  apply  the  Laplace  transform  relation  (1.2.3) 
to  obtain 


1F1(n2,®,L)  - [rk(|(nx+n2  )}]  / / etr(-X)(det  X) 

V (k,  p)  X>0 

x (dXXdHj) 


^(ni +n2-k-l  ) 


where  X is  a k x k positive  definite  matrix.  It  follows  by  applying 
Bessel's  integral  (1.2.7)  to  the  F function  that 

„ , ~i  , i(n ,+n-k-i ) 

lFi  (n2>  ®»L)  - tr  +n  ))]  f J f etr( -X ) (det  X ) 

V(k,p)  X>0  0(nx ) 

A A A A X 

X 6^(2^^^^  L^sCj^J  (d;: ) (dX ) (dHj  ) 
where  Mj  is  the  n x k matrix  formed  by  the  first  k columns  of 

r h * T * 

M fc  0(nj  ) and  0 in  L .-0]  is  the  k x (n^-k)  zero  matrix. 

Since  the  integrand  does  not  depend  on  the  last  nt  - k columns  of  M , 
we  may  integrate  over  ther  using  Lemma  3.2.1  to  obtain 

tF  (n  .^L)  - Cr.{*(Vn  5}]-1  III  etr(-X)(det  X)^  1 2 * 

V(x,  p)  X>0  V(k,  n1) 


x etr{2^i2^lX^©1^H1 'L^Ollj } (dMj  ) (dX  ) (dHj  ) . 

Let  X - 4n  G V2G  where  V * diag(v  , . . v.  ) with  v > v > •••  v.  > o , 
* 1 K 1 2 X 

and  G is  a k x k orthogonal  matrix  with  positive  elements  in  the  first 
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column.  The  Jacobian  of  this  transformation,  J{X  -*  (V, G))  is  given  by 

2 


(in  22k 


J[X  - (V,  G) } - det  V n (v  2-v  2) 

i<  j J 


.(&) 


(see  (3.2.3)  and  the  subsequent  discussion).  Using  the  fact  that  the 
resulting  integrand  is  invariant  under  transformations  which  change  the 
sign  of  the  elements  in  one  row  of  G , we  have 


(5.2.1  ) 


where 


,F.(n  , S^L)  = C / T|(y){g(y,  e,4)}  2dy 
2 A 


ik(n  +n  ) i 2 

(K)  2 


2 V^W^k1^5 

n -k  k 

•K y ) - (det  V)  n (V  -v  ) , 

i<  j J 

g(y,  j,-C)  - etr{-^2  + [g'vg©A  '^:0k  ) det  V , 

1 1 1 

A - V(k,  p ) x o(k ) x Dy  x V(k, ^ ) , 

Dv  - { (Vj,  . . . , v^ ) : v:  > v?  > •••  > vk  > o)  , 

J « ( 0 > • • • 

b » ( > • • • » Vp  ) , 

and  y is  a point  in  A . Finally  make  the  transformations  G -*  G , 
Hj  -*Hj  , and  MjG'  - Ej  in  '5.2.1)  to  get 


(5.2.2) 


.MV^L)  -C  / i)(y)  (h(y,  t), •b))  dy 
2 A 


where 
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h(y,  d,4)  - etr(-^V2  + [VG©^  'l^O^ } det  V . 

We  now  determine  the  maximum  value  of  h for  fixed  ® and  L and 
show  that  the  maximum  is  obtained  at  a finite  number  of  interior  points 
of  A . The  result  is  contained  in  the  following  lemma. 

Lemma  5.2.1  — 

Let 


(i) 

- 

diag(  61,  3^) 

with  Jj  > a2  > • • • \ > 0 5 

(ii) 

L 

- 

diagf-tj,  •••,<') 

with  ■£,  > > • • • > l > o 

l 2 p 

( iii ) 

V 

- 

diag(va,  . ..,vk) 

with  v,  > v„  > • • • > v,  > o 
12  k 

(iv) 

G 

- 

(g.j)  t 0(k)  , 

(i<i,j<k)  ; 

(v) 

El 

- 

(e±J)  € V(k,nx) 

(l<i<nj, !<j<k)  ; 

(vi) 

Hl 

- 

(hi;J)  tV(k,P) 

( l<i<p,  l<  j<k)  ; and 

i 7 

(vii ) 

h 

- 

etr(-^V2  + [VGQj 

% 'L^rOjEj } det  V . 

Then  the  maximum  value  of  h for  fixed  @1  and  L is 

2 ke"^  II  + exp  S {-u^  + (•<:'idi)^('£'19i+4)^] 

and  the  maximum  is  obtained  if  and  only  if 
vi-i((^9i)^+  (^191+4)^)  (i<i<k)  , G - diag(+i,  ) , 


Ii . o 


Ii . o 


H.  - o II  and  E - 0 II 

i • • • • 1 •••• 


where  G , Hj  , and  E1  satisfy  the  following  constraints 
®iihiieii  m ^ (*^i<k)  . 

Proof — 

Let  hj  - eT  where  T . tr{  [VG  © j ^ 'l^:0]E1  ) and 
2 

h2  • etr(-£v  )det  V . Then  h - hjh2  . Maximizing  T i3  equivalent  to 
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maximizing  h since  hj  is  a strictly  increasing  function  of  T . It 

1 

follows  from  Corollary  l of  the  Appendix,  that  for  fixed  V , ® 8 , and 

A 

Ls  , the  maximum  value  of  T is 

a ic  ^ 


k 

7 v.  e.  : 

la  1 


11  1 


and  the  maximum  is  obtained  if  and  only  if  G « diag(ll , . . ll ) , 


H - 

ll  . 0 

0 'll 

and  E,  - 

”ii . 0 

1 

0 

1 

0 

where  G , H , and  Ej  satisfy  the  constraints  - 1 (l<i<k) 

Since  the  location  of  the  maxima  does  not  depend  on  V , we  may  maximize 

h in  two  stages  as  follows. 

max  h - max{h2(  max  h^}  . 

VjGjHjjEj  V ' G,H1,E1 

The  problem  of  maximizing  h thus  reduces  to  maximizing 


; k 


* . b 


gx(V)  - exp  \ f-£v.  + vi'3i"-u1^  + In  Vi)  S 


i-l 


II  {expt-^2  + + In  v.  )) 


k 

n 

i-l 


Consider  the  function  g2(x)  defined  for  all  x > 0 by 

2 

g2(x)  " -is*  + xor  + In  x 

with  u > 0 . It  is  easily  shown  that  g2  has  an  absolute  maximum 


value  of 


'a  + (a  *4r]  + -{<*  + { or  +4  l s} 


-b  - In  2 + ln( or  + ( a +4)2}  + -(a  + (ar  +4)8} 

2 -5- 

which  is  obtained  if  and  only  if  x - £{ar  + {or  + 4)z)  . The  lemma 
follows  from  this  result  and  the  fact  that 

£(('t'1a1r  + > •••  > i{(-kJk)  + > ° ’ 
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By  the  same  argument  that  led  to  (3.2.6),  h obtains  its  maximum 

2k  . pv 

a'-  - .interior  points  ot  3 and  A can  be  partitioned  into  2 dis- 

J)  Jr 

Joint  sets  >V  (l<i<  ) such  that  each  A contains  exactly  one  of 

these  points  in  its  interior.  Furthermore,  A is  of  the  form 


(5.2.5) 


Ai  " Mil  X «i2  X &V  * Mi 3 


where  -V(k,P:  , Mi?  ■ 0(k;  , and  M.  5 cv(k,n  ) . 

Now  write  (5.  * as  tie  sum  .;f  2 K integrals  where  each  integral 

n2 

is  the  Integra ’ of  Cf  . !.  ove  ■ on"  of  the  A^'a.  The  only  part  of  the 

z 

integrand  which  depend  a on  Q , , c R.  is  > , where 

. k,  < h 

\ - etr(n  LVG  ; , r.  L‘;:0jF  ; . \ is  invariant  under  transformations  of 

the  form 


G “*  GS  , H diagfS  , I , )H  S , 
l j B 2 ’ p-k  'lj ’ 


and 


E - diag(S  , I . )E, 

1 2 nj-K  1 

where  S,  and  S?  are  k x k diagonal  matrices  with  tl  on  the 
diagonal.  The  integral  over  can  he  transformed  by  en  appropriate 

choice  of  Sj  and  S2  to  an  integral  of  tie  form 


(5.2.4) 


r. 

Ji  ’ Cn  f 'Ky){h(y,  n,t)}  2dy 
2 1<L 


Sj  and  S?  are  chosen  so  that  h obtains  its  maximum  at  the  interior 


point  o('J,  C)  of  ..  defined  by 

i. 

r 

(5.2.5) 


G“  h ' 


H,  “ 


>.  ] 

L o J 


r 1 


and 


'A  - ' + (t1u1r4)-) 


(l<i<k) 


It  follows  fr'rn  (5.?. 3)  that 


is  of  the  form 


(5.2.6) 
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Qi  " Nil  X Ni2  X DV  X Ni3 


where  cv(k,p)  , Ni2  c c*(k)  and  N.j  cvik,^)  . 

Combining  these  results  we  have 


(5.2.7) 


F (n  , ®,L)  - J,  . 

i«l  1 


We  will  derive  an  asymptotic  representation  of  Jh  . In  fact 

cp^  - cp  for  all  i , and  hence  by  (5.2.7),  an  asymptotic  representation 

for  1F1(n?,  0,L)  is  given  by  22%  . 

We  can  show,  by  the  same  argument  that  led  to  (3.2.11),  that  for 
2 k 

each  i (l<i<2  ) can  be  partitioned  as 

• ^ - E U (U^-E)  . 

All  orthogonal  matrices  in  S are  proper,  £(  9,  <,)  is  an  interior  point 
of  S for  all  ( 0,  l)  , and 


(5.2.8) 


" - Nj  x N2  x Pv  x N3 


where  Nj  , N2  , and  N?  are  neighborhoods  of 


' h 
• • • • 

0 


€V(k,  p)  , , and  ....  £ V(k,  nt)  , respectively. 


may  be  written  as 


(5.2.9) 


where 


(5.2.10) 


(5.2.11  ) 


Ji  - Cn2(VLi2> 


Lj  - / T|(y)(h(y,  8,  1)}  ?dy 


Li2  “ f * -))  dy 

Vs 
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Lx  and  Li2  will  be  treated  separately.  We  will  use  Theorem  2.3.1  to 
derive  an  asvmptotic  representation  of  Lj  , and  then  we  will  prove  that 
L^2  is  asymptotically  of  lower  order  of  magnitude  than  Lj  . 

Using  the  notation  of  Theorem  2.3.1  let 

(5.2.12)  A - { ( 8,-v.):00  > j > <3  > — > a.  > o , l > t >■£.>•••>  -L  > a) 

i t K 1 Z p 

and  for  each  e , o < e < -§  , and  K > 0 , let  B - B(K,  e)  be  the  set 

(5.2.13)  B - {(9,  •£.)  fcA  : K>  91  , 3^  - j ^ > e (l<i<k)  and 

^ - 4J+1  > £ » where  \+,  - *p+i  - 0}  • 

Derivation  of  an  Asymptotic  Representation  for  . 

G , Hj  , and  Ej  can  be  parameterized  in  H by 


G exp(S)  , 


CHj :-]  - exp (W ) - exp 


r yu 

■”l2  0 


and 


[E,  :-J  *<  exp(Z)  - exp  1 

i i 


1 1 


21 


-z. 


21 

0 


where  S , , and  are  k x k skew- symmetric  matrices,  Wj 

is  k x (p-k)  , and  z is  (n1~k)  x k . The  Jacobians  of  these  trans- 
formations are 

rk<&>  2 

J(G-S)  - r (1  + 0(8  )}  , 

I'k(tP)  2 

J(H,  - (*„,»„))  - -JTJ-  {1  ♦ 0(«y  I)  , 

2 TT 


rk(^l  ) 2 

J(Ej  - (Zn,Z21)}  - — - U ♦ 0(z4  / )) 


k in,  k 

? ti'1  1 


'U 


and 
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(see  (3.2.17)  - (3.2.20)).  In  this  section  the  notation  ) means 

terms  in  the  s . which  are  at  least  of  order  m . 

The  point  3(d,-G)  defined  by  (5.2.5)  is  mapped  into  the  print  5(9,4.) 


defined  by 
(5.2.14) 


S “ Wll  " ZU  “ 0 » wi2  " 0 > z2i  ” 0 . and 

Vi  “ + (i<i<k) 


and  H is  mapped  into  a set  D which  contains  ^,(d,  -i.)  in  its  interior. 

It  follows  after  some  simplification  that 

A , A k A i 

tr(  [VG  GJaH1  L2:0]Ej  } - 3)  v.  9.  ’-t^-  + f 


where 


(5.2.15)  i - - X {§(v.y.?u.s  + v.o,®- C.  ) ( s 


K*  * v.o.V*l(s.  } ♦ w.  2 + 


i i i j 3 j ij  iJ  ij 


1 A A A A A A A 

+ (v.  d.  A.  + v.  9,  s)s.  .w  + (v.  9.  z-t.  + v . 9.  3-C,.  3)s 
i j i jij  ijij  ijj  j i i ' 

A A A A 

+ ( V . J.  2 1 3 + V . d . e-t.  8 )w . . z . . } 

1 1 j J J 1 IJ  IJ 


iJZiJ 


k p 

-XX 
i-i  j-; 


X i.tv-  d.^-t.^fz  +w  2)  + v 0.  a4.  pv  z ] 

l j.k+l  1 1 1 iJ  iJ  ii  J ijiJ 


k °1  A A 2 

•i£  ! Vi  VV 

i-l  j-p+l  1 10 

Combining  these  results  we  have 


(5.2.16) 


Lj  - / T(x)(f(x,  d,4.)}  dx 
D 


where 


f(x,  9,4.)  - exp<  X {-£v.2  + 0^^)  + f’v  nVl 
! i-1  . 1-1 
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T ( X ) 


rk(sk)Ik(^p)Ik(^ni  ) k ni'k 

fl  v. 

23knfk(k+p+n1  ) i„i  1 


2 2 
n (v  -v/) 

i<j  J 


X J(G-S)J{H1-(W11,W12)}J(E1-(Z11,Z21 })  , 

D is  the  image  of  S , and  x € D . 

An  asymptotic  representation  of  1^  will  now  be  obtained  by  applying 

Theorem  2.5.1  to  (5.2.16).  We  must  verify  that  the  conditions  of  the 

theorem  are  satisfied.  A basic  difference  between  the  functions 
2 2 

2Fj(n,P  , R ) considered  in  Chapter  3 and  1F1(n2,©,L)  , is  that  the 

population  and  sample  roots  are  bounded  in  the  former  but  only  the  -J,  ' b 

are  bounded  in  the  latter.  It  turns  out  that  the  conditions  of  Theorem 

2.3.1  are  satisfied  by  the  integral  representation  (5.2.16)  for 

lFi(n2,0,L)  only  on  bounded  sets  B of  (0,  t)'s.  For  this  reason  we  had 

to  introduce  the  parameter  K in  the  definition  (5.2.13)  of  B - B(K,  e)  . 

K is  an  upper  bound  for  the  9^'s. 

Verification  of  the  conditions  of  Theorem  2.3.1. 

Conditions  (i)-(ix)  can  be  verified  by  essentially  the  same  arguments 

that  were  used  in  Chapter  3 to  verify  the  corresponding  conditions  for  the 

integral  L.  defined  by  (3.2.23).  In  effect  we  need  only  verify  condi- 

tion  (v)  and  prove  that  Lemma  3.2.3,  which  is  used  to  verify  condition 

(vii),  holds  for  h(y,  % t)  defined  by  (5.2.2). 

(v)  We  have  to  show  that 

r < inf  v2(0,  t)  and  sup  7^(9,  t)  < ^ 

B B 

2 2 2 

where  y ( 9,  -C)  > y ( 9,  t)  > • • • > y.  ( 9,  t)  are  the  latent  roots  of 

A ■“  c “ "■  K 

U{5(  0,  L),  9,  -t}  . Q{x,  9,  t)  ~ (w^j  (x,  0, -£.) ) is  the  k x k matrix  defined 
for  all  x © D and  ( 9,  t)  f A by 
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''ij1*’8'41  - - Ira r • 

* J 

To  compute  U{§(  9,  l),  9,  -C)  we  need  the  second  partial  derivatives  of 
t(x,  9,  t-)  = -In  f(x,  9,  -t)  evaluated  at  { i ( 9,  l),  9, 1}  . It  follows  from 
(5.2.15),  that  for  fixed  ( 9,  t)  the  only  non-zero  second  partial  deriva- 
tives of  i|f  evaluated  at  {§(  9,  £),  9,  -C.)  are 


3%  2(Vi+4>3 

(ei'C'i,)^  + (ei'£'i+4)1 


(l<i<k) 


,2  , ^2  , .2  -I 

-Mr-il  M,)* 


3s  . dz.  . 

ij  ij  ij 


2 “ •|(^iei)S{('£.i0i)a  + (^9.  +4)^} 


+ ^(Vr  f(V.j)5  + e,+4)S ) (i<i<j<k)  , 


n 
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-^2  - i(9i'4;)^(^i9i)^  + (^3^4)^} 
dz  . 
ij 


(i<i<k,  P+1<j£ni ) • 


It  follows  from  these  results  and  the  fact  that  ( 9,  L)  6 B implies 

e < l < l (l<i<k)  and  e < 0.  < K (l<j<p)  that 
J*  J 

sup  |w  (g(9,  L),  6,t}|  < 2 (K+4 ) = C < 00  • 

B J 

By  Lemma  2.3.1 


V 2(  0,  l)  » max  x’0{§{  0,  t),  9,  L>x 
1 x 1 x*=  l 


k k 


< max  Z Z |x  | |x  | | w .(5(  8,  •£.),  9, -£.}| 
x'x-i  ini  j-i  J J 


< £ max  { Z | x . | } 
x'x-l  i=l  1 


< k C • 


sup  Y1  (9,  l)  < ® . 

B 

To  prove  that  inf  v 2 ( 9,  t)  >0  it  is  sufficient,  because  of  (3.2.25), 

B k 

to  prove  that  inf  A(  *(  9,  L) } > 0 , where  A{;(8,  -)}  is  the  Hessian  of 
B 

i/  as  a function  x evaluated  at  £(9,  i)  . 

By  interchanging  pairs  of  rows  and  the  corresponding  pcirs  of  columns 
of  ii( ^ ( 8,  l),  8,  1)  , we  can  express  A{£{6,  t;)  as  the  determinant  of  a 
block  diagonal  matrix  of  the  form 

diag(Wj,  ...,Wk,X12,  • • •>Xk-i,k,Yl,k+l’  ••■,Ylp,  •**,Yk,k+l'  ,’',Ykp, 


Zj> • • • > Z^) 


where 
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w - — -1- 

1 

1 


d2^ 

V 

^ij^ij 

b2t 

d2i| t 

V 

r^t 

b2<l 

dw  2 
ij 

d2(- 

b2$ 

*u‘  J 

(l<i<k)  , 


£±. 


a2^ 


a% 


(l<i<j<k)  , 


and 


Zi  " ^Vi^W^  + (^i9i+4^)In 


All  of  the  derivatives  are  evaluated  at  the  point  { §(  0,  -£.),  8,  t]  . We 
then  have 

k k k p k 

A(s(0,  £)}  - n W.  n det  X . n n det  Y.  n det  Z.  . 

i-i  i< j 1J  i-l  j-k+i  i-l  1 

After  a considerable  amount  of  simplification  A(£(9,  -C.})  reduces  to 

(5.2.17)  a(s(0,^)}  = 2k(k+2_p'ni)  n n [((^.e. 9,+4)^}2(i -4  )9  ] 

i-l  j-k+l  11  11  1 J 1 


k 

n 

i<j 

k 

n 

1 


X*- 


x ^ t(  ^ ) ( 0±- ^ ^ ,s(  e1+4  ^ ] 

fl  Uei-bi)^(ni~P){(ti0i)^+(.(,i9i+4)^)  1 P (-6^ +4)^1  . 


x 
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It  follows  from  (5.2.17)  and  the  fact  that  ( 0,  l)  fc  B implies 

Qi  " 9i  + i > e and  ” ^+i  > 6 (l<i<k,  l<j<P)  where  9^  - V*  “ 0 » 

that 

inf  A{?(0, -t)}  > o . 

B 

(vii  ) Cl  was  defined  in  (5.2.6)  as 

\ - Nil  X Ni2  X DV  X Ni3 

where  cv(k,p)  , Ni2  co(k)  , and  Ni?  cvfk,^)  are  neighborhoods 

of 


and 


0 


respectively. 


If  y £ (1  then  the  components  of  y are  values  of  v , G , , and 

E . Write 

y - (HlfG,V,E,  ) 

and  let 


We  can  verify  condition  (vii)  by  the  same  argument  that  was  used  in 
Chapter  3 to  verify  condition  (vii)  once  we  have  proved  the  following 
lemma. 

Lemma  5.2.2  — 

For  every  6 > o there  exists  a constant  n-n(6),  0 < m>  < 1 > 

such  that  if  y t for  some  i (l<i<2  ^)  and  d(y^,  3(  9,  ^) ) > 6 for 
some  ( 0,  L)  € B , then 


|h(y,  0,  l)/b[  P(  0,  -£.),  0,  -t)|  < n . 
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Froof — 


Let  tj  - t^(  8,  -6)  . + (oj^+4)5}  (l<j<k)  . It  follows  from 


the  definitions  of  y^  and  P(  3,  i)  that 

i k 2i  i 

d(yA,  P(0,i))  „ ' z (Vj-tj)  \ • 

!j  -i  j 


A 


That  there  exists  a J (i<J<k)  such  that  |v  - t_|  > 6k~2  is  implied 
by  d(yA,  0(  9,  4) ) > 6 . From  Lemma  (5.2.1)  and  Corollary  l of  the  appendix 

h(y,  9,-L) 


My,  9,  l) 
h [ 0(  0,  l),  9,  1} 


< sup 

0,fVE^ 


h{6(0, T),  0,  .)) 


exp 


k 

2 

i-1 


2 + + In  v.  - In  t.  + ^ 

- *(8iVM 


< expfg^Vj,^)] 


where 


8i(V  W " ~^J2  + (ejVS  + 111  v.r  - ln  \t  + * ■ £(9jV\  • 


j j’  j'  ■ e'j  i’j'j 
By  Cauchy's  inequality 


, , -1  v -l,  2 2 

VJ  " VJtJtJ  i (VJ  +tJ  5 * 


Therefore  gj  (Vj,  Qjt  < gg  (Vj,  0 , 4 ) where 

«2{V  W - “V(1  - V1(  + 111  VJ  - m tj  A 

- + ln{vjVl)  + * • 

To  prove  the  lemma  it  suffices  to  prove  that 

sup  g_(v  , 9 , l ) - v < o . 

V/j 
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The  lemma  will  then  follow  by  taking  0 - ev  . Since  |v  - t | > 6 k~^ 
we  can  write 

v r.  t + d 
J O 

where  |d|  > 6 k"^  and  tj  + d > 0 . Make  this  change  of  variables  in 
g2  . After  simplifying  we  have 

g2(V  W - gJ(dtJ_1) 

where 

g?(t)  -^t2  - t + In ( 1 +t ) , t > -1  . 

It  is  easily  shown  that  has  a maximum  value  of  o which  is  obtained 

at  t - 0 , gj(t)  > o ( -l<t<  o ) , and  g^(t)  < o (txt)  . Since 

t + d > 0 we  have  dt  1 > -l  . If  ( 9,  -t)  6 B then  e < 9 < K , 
o «J  J 

< & < l , and  therefore 
J 

tj"1  > 2(K®  + (K+4)^}"1  - X>  0 . 


Combining  these  results  we  have 


sup  S2(vj>  9j» 'V  ” 3up  g3(dtj"  5 


V V 


dt. 


-1 


-1  .1 

< max{g3(6  k aX),g?(-6  k SX)} 

< g(0)  - 0 

proving  the  lemma. 

The  argument  used  in  Chapter  3 may  now  be  applied.  Summarizing  we 


have 

Lemma  5.2.3 — 

For  every  6 > o there  exists  a constant  n - ^(6)  , o < n < l , 


such  that 


THIS  PAGE  IS  BEST  QUALITY  PRACTICABLE 
COPY  FURMSHOl  TO  UDC 


h(y,  e,  i)  j 

l),  S,£]|  < u 

frr  all  y c ^ - s{  ,0(0,1;!  , ( 6,  i.)  c B , and  i (l<i<?2k)  . 

Since  *1  l of  the  conditions  c.f  Theorem.  .,3.1  are  satisfied  we  have 

(5.2.18)  I,.  -T{c,{d,  o)}[f{§(,.,^),  e,  v}J  r[u^(0,4)}]~:5'2n/r.?)^inl+p'?fk+1)i 
uniformly  in  (0,t)  c B . Using  (5.2.17),  (5.2.18)  reduces  to 

(5.2.19)  l.  ~c  n C(  j.-e.)(i.  -0.)}’*  n a ( % ( t.-o,  »)’* 

f i<.1  “ 1 - i-l  J-k-u  1 1 3 


x !!  L(  ’ 0.  -6.  ) *->■  ( Q C +«  ) *•) 
, , l i 11. 


»_  n +&«>,-!•  ♦*)  - r (p-n  )/u 

{41  V4)  (CiV 


where 


i 1 k r A A.  A 1 

* ^>rt ! IT  n2 

L i-l  ' 


k ;Skf  2p-2n  -Sk-i5/2) 

)e  2 


t)K  ( 3k*-l  )/H  ^ 


|Stln,  +p->k-£} 


Asymptotic  behavior  of  L. 


Fy  exactly  the  aame  a*  irument  that  was  used  ir.  Chapter  3,  we  can  show 


• n+n  -k  Mn,  +p-$(k+l  }) 

IL,  A,!  < .*  n.  t 

ior  all  y t - u , i >'l<i<2  ) , and  ( U,  -t)  S B , where  v is  a 

constant,  u is  a constant,  o < u < l , such  that 

|h(y,  U,  ^)/Ti  ( 3(  i»  t),  t), j < ti  , arid  s giver  by  the  rtghthand  side 

of  (5.2.18).  Therefore  L,  is  aaynrptoti cally  of  lower  order  of  mogni- 


-1K- 


THIS  PA&E  IS  REST  QUALITY  PRACriCARLJ 
FSi/M  OOdr  Y Fl&Jil  ShJiiD  I'D  ilw  C 


riieorem  5. 


• 

Let 

- 

i) 

L - diag( ^ , .. 

*'V 

with  1 > > • • 

O 

A 

tt. 

A 

ii) 

~ diag(  9, , . . 

*'V 

with  B.  >8  2 > • 

••  >\  > Vi  - •••  - ep  - 

ill) 

A » { { % t)  ; i 

> : 

> • ' * > *w»  > c ^ 
P 

9,  > 92  > • • • > \ > °) 

where  9 - ( o1 

k ) ’ and  4.  ~ ( 4, , 

; and 

P 

iv) 

for  every  t , 

0 < 

e < £ , and  K > 

o , 

B " 

B(K.  e) 

" { I 9,  '•  ) A : 

K > 9 

. 9.  - 9.  > ti  , 

••  i i+i  ' 

{i<i<rk)  and 

t. 

0 

" V* 

> 6 » (l<J<p) 

where 

Tt+l  p+i  J 

. Then  for  all  ( 8,  4. ) t A 

k P _i 

(5.2.  ) . ' . ' ' *n  , •:  : ■ • ■ ) 

il  A * - 1 d O 

1 *»  * , 1 « 1 


i; n2 


x u!(0ri)s‘(e.v«r) 


n,+4(n  -p+i ! 


(p-n  )A 


SV"1  (Vi 


] 


x e:op 


in  v 

4 2 . 

X«*  l 


w f*r< 


K ** 
I 


W ,‘k(^p)n; 


t»2-'  +£H£k  , jk(  2p-*k-3n2  -n.  - ?/z  } -&V* 


V. 


*.  n.  +n„  ;]  n 


ki  k-1 1 ; ''4 


Furtberrr.  re  (5.2  20)  ool-.s  ur  i f’orraly  lr.  (d,*-)  € B for  every  ® and  K 


5.3.  The  ne:  m of  Crier  n„  " ir.  the  Asyny,  5 •-  ^ L • y.xpa:  sion  of 
, F ( ( r.  *n  };£n  :&r„^,L)  . 

* 1 it'. 

As  indicated  ir  '""Miter  Section  4,  irther  terms  <r  an  asymptotic 

•wans ion  -f  F (n  u'.d  pa  obtained,  it  »r-‘  n principle,  by  b 

11  2 

more  careful  analysis  of  the  integrals  in  Section  0.2.  Such  an  analysis 
would  lead  ti  an  asymi  totic  extension  of  the  form 
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jFj  (n2,  vd),  L)  ~ <PG 

where  V is  the  asymptotic  representation  for  ^ (n^,  6$,  L)  given  by 
Theorem  5 2.1, 


P1  P2 

G"  1 +~  + — 
2 n 

2 


and  the  P.’s  are  functions  of  the  d.'s  and  £. ' s but  do  not  depend  on  n. 
Suppose  that  S is  a p x p symmetric  matrix  with  latent  roots 
sj>-  ’>Sp  ’ ^ = diagCT^O)  is  a p x p matrix,  and  T is  a k x k 
symmetric  matrix  with  latent  roots  t , . .,t  . Then  by  Theorem  3.3.2, 

jFjtajcjTjS)  satisfies  the  following  partial  differential  equation 


£ d2p  P P si  dv  , P 

2 s.  — At  + 2 2 i 2L_  + (c  _ ^(p_i  , i z 

1-1  1 = 1 ,]  = 1 i j X i- 

i/j 


y dF 

,Z  '557 

1=1  i 


- 2t: 


aF  2 t.  . 


Let  S = L , T = , a = tlfn^+n^)  , and  c = . It  follows  after 

seme  simplification  that  F^  («*,  L)  satisfies 


(5.3.  1 ) 


V ■(  d 2 F ^ v p 1 dF  if  2 dF 

,S.  i 77?  S S 7— TT7  517  + ^(nrP+1)  .Z  5*7 

i=l  d-t^  i-1  j = i i j i i.i  i 


k 2 dF  n?(ni+n?)  k 
i=l  l 1=1 

Substituting  9G  for  F in  (5.3.1)  we  have  after  a considerable 


amount  of  computation  that  G satisfies 


THIS  PASS  IS  BEST  QUALITY  PRACTIC JUUtt 
; i ( ..  PfuOM  OCi  i FUKtU SKtt>  WilDC 


P A*n  P P v 

£ t.  £ji-  ♦ £ £ ± iS 

i.l  1 bt  ? i-k+i  <-k+l  P.  - dt 

i/j 


4 , 

-- — - -i!r,  -r-i  I 2 


i If 

i-k+1  ck< 


k j .i,  - 

h £ (->*)*{  (tfp  ♦ (t  i +4)^}  -- 

2 j . 1 1 11  * 1 


i 

A -i 


K 

4r  V 

1 (n  -p  ){-£■.  9. 

| - - - ----  --  i 

(v .d.  )''(  v S3.  + 4 ) + t. 

1 i i i i 

°G 

<c  «£• 

, , i 

t 8 4 

d-C. 

L <Vi*4! 

^ * 

i 

V a ? -C< 


k (p-i  ) : 


. ^ v 9 * _±_  - - I L , — *- 

' L-i  ■ <3,.  •*  j i-l  j S?:iV4' 


X }.  X 

; - ;■ 

i 1 X 1 11  i.  1 J X I 


. 


(W^ 


-4  M ! 


k ^ . k p >t . + t ~1 

- £ £ i — - - £ £ -1 jL-j 

i/j  ( - .-C'  | )2  l-i  j-k+ 1 (•- , - 1 . ) j 


The  differer  ■ Lai  equation  for  F is  obtained  by  substituting 


?i  ?: 

1 T - + “7  + ’ ' 

n n . 


f(  r Q in  the  previous  d - i ffl  ! S eati 

of  powers  of  t;  ' T1  " resulting  equation  for  P,  is 


(5.5.2) 


k i 1 bp  k dp 

,sfV t(W>‘  * 'Vi1  1 j - * V r1 

i«l  o<-.  i»!  o« 


f k J'  (p-n  )(p-n,  + 2)  (?-n.  )d  {(S  : - (<.  9 )*} 

fp  | £ L : ♦ i 1 l—L i 1 

'li-H  VW4)  (t1V,v* 


e,(<V«  )V,V*r  * 1 - i ,a  1 ",  k c 

A 4 4 .1  J 1 > yn  Y * 

(tiV4>2  j Uj 


t is  evaluated  us inf;  t e boundary  conditions.  If  we  can  find  a Q which 

Exarai nal i on  of  (5.3  « 

shove  that  a function  sati  o Ivine  r ■ . 3 must  • e :denl  ion  iy  constant. 

Since  there  is  only  one  boundary  condition  for  F,  , there  is  no  way  to 


determine  the  constant. 
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A particular  solution  of  (5.3.2)  is 
(5.3.5)  Q 


k 

Z 

i-l 


(p-nj  )2(-t'i«i+2)  (p-nj  )((^ai)~  - (^.e.+4)?) 


8 (^  0,  )“(-£.,  d.+4  ) 
i i ' i i ' 


* 


2(^;d.)?(4.id.+4) 


6 (-6  0 )^  - (-L.0.)3/2  - 12(-L..0.+4  )^ 
i i i i ' ii  ' 


(i,  0.  +4 ) 

i l 


3/2 


k 

+ Z Z 
i<j 


i A.  A \ 

(410i)^(i  0j+4)-  + (*t  e ) 5('ti9i+4)‘ 


('£'1-'C'j)(0i-9J){(^16i)^  + ('Li9.+4)^}{(^j0J)^  + (-^0>4)*) 


( l. 0.  1 
11 


k p 

+ Z Z 

i"i  j"k+1  0.  (■&,  -<-  ■){  (-t»0.  )*  + (-L-  0 +4  )^] 

l i o i i ii  11 

Note  that  Q(W,  L)  = Q(L,  ^)  . Hence  the  general  solution  of  (5.3.2)  is 

P,  - Q + V 


where  Q is  given  by  (5.3.5)  and 


is  a constant. 


Additional  P^’s  could  be  calculated  by  the  same  technique  but  the 
partial  differential  equations  for  P.  are  very  complicated  when  i > 2 


CHAFTER  6 


MANOVA  AND  DISCRIMINANT  ANALYSIS 
6.1  Introduction. 

Suppose  W and  B are  independent  p x p random  matrices  such 

that  W has  a Wisnart  distribution  on  n?  (n„>p)  degrees  of  freedom, 

WpfnJS)  , and  B has  a noncentral  Wishart  distribution  on  nj  (nj>P) 

degrees  of  freedom  and  noncentrality  matrix  2 1 A , Wp(nj|2|2  XA)  . Let 

W «.  diag('“1,  ..,<*>),  ^ > ii?  > . . > '^p  > 0 > be  the  diagonal  matrix 

of  the  latent  roots  of  2 1A  and  let  L - diag{ ■<-  , . . . , ) f 

L > t > • • • > 4 >0,  be  the  diagonal  matrix  of  the  latent  roots  of 

12  p 

B(B+W)  1 . In  MANOVA  and  discriminant  analysis  B and  W are  respec- 
tively the  "between  groups”  and  "within  groups"  matrices  of  sums  of 
squares  and  sums  of  products  (see  Chanter  l,  Section  4).  Finally  assume 
that  n = n^  , where  - diag(  9^,  . . 9 ) with  9^  > 9^  > • • • > > 0 

The  joint  pdf  of  is  (sea  (1.4.1)) 

(6.1.1)  f (L)  - etr(-^2£)iF1(£{ni+n2);^;in.©,L') 

P i(n-p-i)  i(n-p-i)  p p 

x C,  n (4  1 (1-L  ) 2 } li  (L  -t  ) ll  d-t 

1-1  1 i< j J i-i 

where 


n^p  r (i(n  +n  )J 

£ a t i . . * 

1 rp(^)rp<*Vrp(*p> 

Since  the  distribution  of  £ , . , depends  only  on  9 , ..,9  that 


part  of  the  pdf  which  involves  9^ , . . , 9^  can  be  regarded  as  a marginal 


likelihood  L . Then 
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(6.1.2)  L - etr(-4n2»)iF1{4(n1+n2);inl;in2«,L}  . 

The  results  of  Chapter  5 are  used  to  derive  an  asymptotic  expansion 
for  large  n?  of  the  pdf  (6.1.1)  and  an  asymptotic  representation  for 
large  n2  of  the  likelihood  function  (6.1.2)  under  the  assumption  that 
the  8^'s  satisfy 

(6.1.3)  9,  > 9 > * ■ • > 8 > tf  . • ■ • „ 0 -0. 

12  k k+ l p 

These  results  are  then  used  to  study  Bartlett's  test  that  the  residual 
p - k 0^'s  are  zero. 


6.2.  Asymptotic  Expansions . 

An  asymptotic  expansion  of  the  joint  pdf  of  £ , ...,t  is  obtained 

* r 

by  substituting  the  asymptotic  expansion  of  JF1{'|(ni  +n2);^;*n2«U) 
given  by  Theorem  5.2. l and  (5.3.5)  into  (6.1.1).  The  result  is  summarized 
in  the  following  theorem. 

Theorem  6.2.1. 

An  asymptotic  expansion  for  large  n?  of  uhe  joint  pdf  of 
when  1'  - n2^  and  the  ^ ' s satisfy  (6.1.3),  is 

P1  -2 

(6.2.1)  H»(1  + -±  * 0(n2  ^ )) 

where 


(6.2.2)  9 - C, 


k 1 


i-l 


+ t^V4) 1 


i"z^("  rp+i ) 


(p-n  )/4 

(“i0i)  <Vi+4) 
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k p i 

x ii  n 

i-l  j-k+l  1 3 

P £(n ,-p-i)  i(n-p-l)  p 

x 11  U 1 (i-t  ) 2 } A (4  -t  ) , 

i-k+l  k+i  J 

i<J 

^k(n^-p+^k+i)  ^k(2p-^k-3/2-3n2~ni ) 


rp-k(l(ni+n2"k))n2 


rp.k{i(ni-k))rp-k{l(p-k))rp(^2)  n{-P2+k-(p-k)2)/4 

x exp{-'ln  (k  + Z 6.))  *1  0.  , 

2 

i-l  1—1 


r - z 

1 i-i 


ii 

(p-ni ) (V.+2)  (P~ni  “ (/t‘iei+4)S) 


8 (-6  0 )2(t.d.+4)‘ 
l i l i ' 


2('£<.Qj  )s(-C».  9 +4  ) 
i i ii 


6 (^i0i)s  - (V^372 " 12  <W4) 

(6.  0 +4  )3//2 


r| 


i i 


k 

+ Z Z 
i<j 


11  11 

^t9!.)  (ijVU)  + (SV  (tiV4) 


k p 
+ Z Z 

i-l  j-k+l 


{VV(VV((-W4  + + ('“jV4)4) 

£ 

r + 


(v  d )' 

' i i’ 


0i(V<tj,t{Vi'  + {^i  V4)  } 


>1  is  a 

constant, 

and 

0(n  2)  are 

terms  which 

for  fixed  -6 

’ s and  0^ 

’ s are 

0(n-2 ) 

as  n “ 

Furthermore, 

for 

every  e > 0 

and  K > 0 

let  B(K, «) 

be  the 

set 

(6.2.3) 

B(K,  €)  - 

*vv- 

•>V  : K > 

V 9i-  9 

i+1  > * 

( i<i<k ) 

and 

i ■ 

j + l 

> e ( i<  .fcP ) 

where 

0 , 1 
k+l  p+l 

- o)  . 

Then  f(L)  ~ <P  uniformly  on  B(K,  c)  for  every  K > o and  e > o . 
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The  function  V is  called  the  "asymptotic  pdf"  or  "asymptotic  dis- 
tribution" of  t I . 

1 p 

The  following  corollaries  follow  easily  from  Theorem  6.2.1. 
Corollary  6.2.1  — 

An  asymptotic  representation  for  large  n?  of  the  marginal  likeli- 
hood function  L defined  by  (6.1.2),  when  the  ^'s  satisfy  (6.1.3),  is 

£ - c}  n C{(-tiei)^  + (iiei+4)^}  2 ^ 1 ,6.{2k_p‘ni >/4(‘t19i+4)  *] 

1m  1 

x exp[^n2  2 C ^ 0±  )^(  ('t'idi  )^  + (‘t£ei+‘+)^)  - 9i)J 


L A 

x n (9  -0  )"“  , 

i<j  3 

where  C,  is  a constant  which  does  not  depend  on  0 ,...,6.  . Further- 
■5  IK 

A 

more,  L~L  uniformly  on  B(K, e)  defined  by  (6.2.3),  for  every  K > 3 
and  e > 3 . 

Corollary  6,2,2. 

■t  ,...,£  are  asymptotically  sufficient  for  0 , ..,9.  . 

IK  IK 

Corollary  6.2.3. 

The  asymptotic  conditional  pdf  for  large  n of  > • ■ • > ^ given 

2 K+ 1 p 


is 


k p i P £(n-p-l)  i(n  -p-l) 

(6.2.4)  <p  . c,  • n n {i  -i  )*v  n [i  1 a-*.)  ] 

**  4 • 4 1 J 4 1,  . . 1 


^ i-l  ji 


; i-k+i 
P 

x n (-t.-v  ) . 

k+i  3 

i<3 


Furthermore  if  ^ is  the  true  conditional  distribu- 
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tion  of  given  then  f(^+1,  . t|  -ty  . . .,lk)  ~ cf>c 

uniformly  on  B(K, e)  for  every  K > 0 and  e > o . 

The  asymptotic  pdf  -P  , defined  by  (6.2.2),  contains  linkage  factors 
of  the  form  ^ ^ . By  making  the  transformation  of  variables  suggested 

by  Hsu  ( 1 94 la)  it  is  possible  to  obtain  a "normal  type"  approximation 
which  no  longer  contains  linkage  factors  corresponding  to  distinct  0^'s. 


Hsu  let 

(6.2.5) 

where 

and 

(6.2.6) 


zi  - n2^Oi"1{^(1_Li)'1-Bi)  (l<i<k) 

Q * , h 
- (26i)2(°i+2)2  , 


Zj  " n24d<I-‘tJfl 


(k+i< j<p)  . 


Making  this  change  of  variables  in  (6.2  2)  and  simplifying  gives 
Corollary  6.2,4 

The  asymptotic  joint  pdf  of  , when  the  d^'s  satisfy 

(6.1.3),  is 

[ k | p £(n ,-P-i)  -iz  v 
(6. 2.7 ) s n g(z  ) > C 11  {z  e } n (z.-z.) 

fi-i  i i-k+i  1 k+i  J 

i<  J 


x < i + n 


1 


-* 


2 ♦ 0(n2-1)  V , 


i-i 


where  g(z^)  denotes  the  standard  normal  density, 

n^(P-k)2 


'5  " ^(nj-k) (p-k ; 


rp-k{^(p‘k))rpk^(nrk)) 


&i  - (3Oi)“1{401  + 4 - 2(0.+2)‘1}  , 


and 


k 
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at  -o1"*(n]  - 2^  - 3 - p + 2 ( J- 1 + (p-k)(2*9t) 


* V2*“i>  z ‘VV"'1  • 

Here  0(n2  *)  means  terms  which  are  0(n2  *)  as  n2  “*  00  uniformly  on 
any  bounded  set  of  z^'s. 

The  first  line  of  (6.2.7)  shows  that  asymptotically  the  z^'s  which 

correspond  to  distinct  nonzero  t^'s  are  standard  normal,  independent  of 

z.  (i^j)  . The  z.’s  corresponding  to  J.'s  equal  to  zero  are  dependent 
0 * i 

and  their  asymptotic  distribution  is  the  same  as  the  distribution  of  the 
latent  roots  of  a (p-k)  x (p-k)  matrix  having  the  Wishart  distribution 

Vk1vk’IP-k1  • 

Hsu  (194 la)  found  the  limiting  distribution  of  the  z/s  when  the 
8^’s  have  arbitrary  multiplicity.  His  result  reduces  to  the  first  line 
of  (6.2.7)  when  the  nonzero  9^’s  are  distinct.  Fujikoshi  ( 1 97 6 ) found 
the  limiting  distribution  of  the  z^’s,  up  to  and  including  terms  of  order 
n2  ^ , when  the  8^’s  have  arbitrary  multiplicity  and  are  nonzero.  If 
p = k then  (6.2.7)  reduces  to  Fujikoshi 's  result. 


6.5.  Tests  of  Significance  of  the  Residual  8^  . 

In  this  section  we  examine  Bartlett's  test  of  the  null  hypothesis 
H^  that  „ •••  = 9^  ..  o given  that  ^ > • • • > 8^  > 0 . The 

approach  followed  is  similar  to  tl-e  one  used  by  James  ( 1969  ). 

Bartlett  suggested  that  H^  could  be  tested  with  the  statistic 

h " {n2  + *(VP-l>}Tk 

where 


* 
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w 


p 

T,  = - n ln(i-u  ) . 
k i-k+l  1 

2 

It  is  well  known  that  n T,  is  asymptotically  distributed  as  X on 

2 K 

(n  -k)(p-k)  degrees  of  freedom.  The  factor  n + 4(n  -p-i  ) was  chosen 
1 2 1 

to  improve  the  agreement  between  the  moments  of  the  test  statistic  and 

2 

the  corresponding  moments  of  X,  , , , , . 

(p-k)(ni~k) 

The  9. 's  are  nuisance  parameters  in  this  test.  Since  ^ are 

asymptotically  sufficient  for  9 ,...,9.  (Corollary  6.2.2),  the  effect 

of  the  9^’s  can  be  eliminated,  at  least  asymptotically,  by  using  the 

asymptotic  conditional  distribution  of  -t.  , . . given  -t-  , . 

K+ 1 p IK 

The  technique  used  here  is.  to  find  the  appropriate  factor  by  computing  the 
mean  and  variance  of  T,  with  respect  to  the  conditional  density  9 

K C 

given  by  (6.2.4). 

There  are  two  points  worth  noting  about  . 

(i)  If  we  make  the  following  identifications 

2 

\ - ri  > r>j  - q , p - p , and  n}  + n?  - n 

in  ^ defined  by  (6.2.4),  then  we  get  the  asymptotic  conditional  pdf 
9 (4.2,4)  of  r,  2,..,,r  given  r 2,  ,.,r  2 . It  follows  from  this 

fact  that  the  same  argument  that  was  used  in  the  canonical  correlation 
case  may  be  used  to  compute  the  mean  and  variance  of  T^  with  respect 
to  9C  . 

(ii)  Constantine  and  Muirhead  (1976)  found  an  asymptotic  representation 

for  the  joint  pdf  of  f°r  iar8®  UV*  ',Uk  w^en 

^ - diag(u>  > . . ,w  ) and  > . . . > iu  > u>  > . ■ . > iu  > t . 

1 P 1 2 K K+ 1 — “ p “ 

While  their  asymptotic  representation  is  markedly  different  from  that 
given  in  Theorem  6.2.1,  the  asymptotic  conditional  pdf  of 
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given  & is  still  . In  other  words,  serves  as  the 

IK  C C 

asymptotic  conditional  distribution  both  when  u>  , ...,ui  are  large  inde- 

1 K 

pendently  of  n and  when  \ are  large  in  the  sense  that 

2 IK 

a n„0^  and  n?  is  large. 

The  following  theorem  is  a direct  consequence  of  observation  (i) 
and  Theorem  4 . 3 .1 . 

The orem  6.3.1 . 

The  statistic 

k p 

L.  - -{n  - k + £(n  -p-i  ) + Z^.'^ln  II  (l-*-  ) 

i-l  i-k+i 

2 

is  approximately  distributed  as  X on  (p-kHn^-k)  degrees  of  freedom 

If  the  observed  values  of  , . . . , t are  all  near  one,  then  the 

multiplying  factor  in  L.  is  approximately  -(n  +§(n  -p-i )}  , which  is 

K 2 1 

the  value  suggested  by  Bartlett. 


APFENDIX 


The  crimary  aim  of  this  Appendix  is  to  prove  two  results  (Theorem  2 
and  Corollary  l ) about  the  maximization  of  certain  matrix  functions. 

The  author's  first  approach  to  proving  these  results  was  to  determine 
the  critical  points  of  the  functions  by  setting  the  appropriate  differ- 
entials equal  to  zero.  The  objection  to  this  line  of  argument  is  that  it 
is  complicated  and  provides  little  insight.  An  alternative  approach  is 
based  on  Theorem  l.  The  advantage  of  this  method  of  proof  is  that  it  is 
fairly  simple  and  easily  generalized. 

Theorem  1-- 

Suppose  that 

(i)  U «•  diagCUj,  . . >\)  with  u^  > u2  > • • • > u^  > o ; 

( ii ) B(<*)  - (b  . (<*))  is  a k x k matrix  defined  for  all 
« £ A ; and 

(iii)  there  exists  constants  6 >6  > • • > 6,  such  that  for 

l — 2 - - k 

all  or  € a 


b (of)  < 6 
ll  - l 


b (o')  + b (of)  < 6 +6 

11  22 ' — l 2 


h.<">  * -•  * W”’  5 6,  * ■■■  * \ • 

Then 

k 

(a)  tr{UB(of))  < Z u,  6.  for  all  a 6 a . 

iil  1 1 

Furthermore,  if 

(iv)  the  u^’s  are  distinct  and  nonzero,  then 
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J 


L 
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(b) 


tr{UB(«)}  - Z u 6,  if  and  only  if 

i-i 


bn[«i  - \ 


b (ff)  + b (O')  » 6 + 

11  22  1 


22 


bu<a)  + •••  + bkk(Qf)  - 6i 


+ . • • + 


Proof-- 

Let 


Sf (<-» ) » bn(a)  + * * + ^(or) 


and 

Then 


A a 6 + • • • + 6 . 

i l i 


(i<i<k) 


tr{UB(cx))  - Z u.b.  .(a) 
i-1 


bllt°,)(UrU2)  + (bU(0,)  + b22(Q'))(U2_U3) 


+ . • • + 


(b^arj  + ...  + 

♦ (\i<a>  + + ’V“)}uk 


k-i 


Z Si(«)(u1-u1+1)  - Sk(^)uk 
i-i 


Similarly 


k J 

2 Vi  - ZVufVl1  + Vk 


i-i  * * i"i 

Condition  (i)  says  that  uk  > o and  u±  - u1+1  > o (i<i<k-i)  . Condi- 
tion { iii ) says  that  Si(“)  < \ for  all  “ e A and  i U<i<k)  . 
Therefore 


Cl) 


k k-l 

Z u.6.  - tr(UB(«)J  - Z {A.  - 81(«) } <VUi+l  > 
i-l  i-i 


+ {\  - Sk(Q>)luk  - ° » 
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proving  (a).  If  the  u^'s  are  distinct  and  nonzero,  then  equality  holds 
in  (l)  if  and  only  if  ^ = S.C0')  (i<i<k)  , proving  (b). 

lemma  l — 

Assume  that 

(i)  H = Hj  j Ha~  € 0(m)  wtiere  H is  r x s ; 

«21  H22_ 

(ii)  o is  an  s x 1 vector  such  that  cr'a  < c ; and 

(iii)  B - Hn«  • 


0’P  < C . 

Proof — 

Let  Y and  '1  be  m x 1 vectors  defined  by  y'  » (<*',c'  ) , where 
0 is  a (m-s)  x 1 vector  of  zeros,  and  'I  » HY  . Then 
V »l  - (HY)'  (HY)  - y'h'HY  - y'y  - < C . 

Also  l|  „ and  therefore 


r 0 - u'Hl ; Hn«  + or’H2;  h21* 


„ p B + (H2l«)  (H21of) 
> 3’B  . 


Combining  these  results 


B'  B < C . 


Theorem  2 — 

Suppose  that 

(i  ) U - diagfu^  . . u^)  with  Uj  > ug  > • • • > u^  > 0 ; 

(ii)  V - diagfv^  ..  ,vk)  with  v]  > vg  > • • • > vR  > o ; 


i 
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(iii) 

Pi  " 

diag( Pj,  . . .,  Pfc) 

with  P,  > P > • • • 

l — 2 — 

i pk  i 0 ; 

(iv) 

R - 

diagfr^,  . . .,r  ) 

with  r > r > • • • 
1 — 2 — 

> r > 0 , (p>k)  ; 

— P — — 

(v) 

F - 

(f  ) € 0(k ) and  G - (g. i ) € 0(k) 
1 J J 

( !<i,  J<k)  ; 

(vi) 

Hi  - 

(h±J ) ^ V(k,p) 

( i<i<P> i< j<k ) ; 

(vii) 

Ei  " 

(e.  ) € V ( k,  q ) 

{ i< j<k ) , 

(q>p)  ; and 

(viii ) T ™ trfCuFVGP  HjR^JEj } . 
Then  for  fixed  U,  V,  P^,  and  R 


(a ) 


sup  T = E u.v. P.r . 
F,G,H1,E1  i«i  1 1 1 '1 


Furthermore,  if 

(ix)  the  u^'s,  v^'s,  P/s,  and  r^'s  are  distinct  and  nonzero, 
then 


(b) 


E u v.  P.r . 


Ini 


i X 1 1 


G,  F,  H , and  Ej  satisfy 


giifiihiieii  “ 1 


(l<i<k)  . 


Proof — 

The  theorem  follows  from  a repeated  application  of  Theorem  i.  Using 
the  notation  of  that  theorem  we  have 

A - {(0,F,H  ,Ej  ):G,F  € 0(k),  H,  € v(k,p),  and  Ej  €v(k,q)}, 

“ - (G,F,Hi,E1  ),  and 
B(° ) - LfVGPjHIRjOJEj  . 


if  and  only  if 

G ~ diag(l l, . . ., ii  ) , F - 

diag(!l, 

• ••»-!)  > 

H - 

l 

"il  . 0 

o #ii 

and  E - 

"ll  . o“ 
0 'll 

where 

• • • • 

. • . 

. 0 J 

0 

Let 
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V Tk’ 


where  I is  the  k x k identity  matrix. 

.K 


sup  T > tr{UB(«  })  = Z u.v.P.r.  . 

0 i-i  1 1 1 1 


Therefore  to  prove  (a)  it  remains  to  prove  t’>at 


T < £ u.v. P.r. 

- i.i  1 1 1 i 


for  all  “ (•  js  , 


(3)  2 b («)  < Zv.P^. 

iol  ini 

for  all  a £ A and  & > then  (2)  follows  from  Theorem  l. 

In  the  subsequent  argument  it  will  be  necessary  to  consider  various 
submatrices  of  G,  F,  H , and  E^  . For  any  r x s matrix  X let  xmn 
denote  the  m x n submatrix  (m<r,  n<s)  formed  by  the  elements  in  the 
first  m rows  and  n columns  of  X , i.e.,  if  X » (x^)  ( i<i<r, l< j<s ) 

then  xm  - (x^)  ( l< i<m,  i< j<n ) . 

Using  this  notation  we  have 

b (a)  + •••  + bw(Of)  „ tr(F  LVGF1FjR:0]Eiq  ) 

- trfCvGFjHjRsOlEj^F^) 

«.  trfVGPjH’REj^F^) 

- tr{VC(t)) 


.S,ViCii(4) 
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where  C(^)  - ( c . j ( -^ ) ) is  the  k x k matrix 

C(^)  » GPjH'RE^V^  . 

In  fact  C(-t')  also  depends  on  ® but  we  have  suppressed  the  a to 
simplify  the  notation. 


m min  ( m,  ) 

(4)  Z c „(!•)  < Z Piri 

i«i  i-l 

for  all  a € A and  m (i<m<k)  , then  (3)  follows  from  (4)  by  Theorem  l. 
We  have 

r.  (0)  + ...  + c (b)  . tr[GmkP  h're/'V”"1} 

1 1 ram  l l l 


tr( P H* RE  ptrtl"<3"k) 


11  1 


tr(PiD(b,m)} 


1*3  1 


where  Df^-m)  ~ (d^Kjin))  is  the  k * k matrix 

. i p-^  bm  mk 

D(-sm)  » H RE j F G 


n min(n,  m,  £) 

(5)  Z dii(l,m)  < 2 ri 

i-l  i«i 

for  all  * € a ‘id  n (i<n<k)  , then  (4)  follows  from  (5)  by  Theorem  l. 


dl  1 ^ * dnn{''’”1  ' trnH1'’VRE1PV'V”) 
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trCRSC-t-,  m,  n } } 


S r.s,  . (-,m,n) 


i-i 


1 11 


where  S m,  n)  - (s^  (£,  m,  n) ) is  the  p x p matrix 

. , p-t.  Um  mn.  on  * 

S(S  m,  n)  E,  F G (Hi  ) 


If 


(6) 


S s n)  < min(£,m,n,q) 


i-i 


for  all  ^ € A and  q ( i<q<P ) , then  ( 5 ) follows  from  ( 6 ) by  Theorem  l 
Condition  (6)  is  equivalent  to  the  following  two  conditions.  For 
all  a 6 a 

(7)  tr{S(Sm,n)}  < min(^,m,n) 
and 

(8)  sii('t,m,n)<i  (i<i<p) 

First  consider  (7).  There  are  three  possibilities.  Namely, 
minf-tym,  n)  - £ , min^m,  n)  - m , and  minium,  n)  - n . Because  the 
proof  is  essentially  the  same  in  all  three  cases,  we  assume  that 
min(^  m,  n)  - ^ . Then 

trfS^rn,^)  - tr  (F^V^K^V 
- SfJo^O'e. 

i-i  3 1 

i om 

where  f is  the  i-th  row  of  F and  e.  is  the  i-th  column  of  E y 
i ll 

Since  F 1/1,1  and  are  submatrices  of  orthogonal  matrices,  f^f  < l 

and  eje.  < l . Let  c/  „ fjo®13  and  ^ - (H^)  e.  . By  Lemma  l 

1 n*  « 

< l and  <_  l . Hence,  by  the  Cauchy-Schwarz  inequality, 
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tr(S(<o  m,  n) } =.  Z 
i-1 


proving  (7). 

To  prove  { 8 ) note  that 


•L  ii 

< Z (Of' or  )*(p  P.  f 

~ i-i  11  11 

< t = min  (•£-,  m,  n) 


s . . (-4,  m,  n)  = e.F  G h. 
ii  l i 


where  now 

e . 
i 

is  the  i 

-th  row  of 

(H*1)’  • 

P4 

Since  E 

and  HlPn 

i 

e.e.  < l 

ii  — 

and 

h!h.  < l 
.1  i — 

. Let  “! 

1 

“'.a.  < 1 

l l — 

and 

i > 

. Hence, 

By  Lemma  l 


sii(-P,m>n)  , 


i i 

v'  tv  \ ^ / ft  ft  \ 


< r(P.P.  )‘ 

— i i i i 


proving  ( 8 ) . 

Let  'a  ”*  b'  mean  'a  implies  b' . The  preceding  discussion  can  be 
summarized  as  follows  We  have  proved  that  (7)  and  (8)  held  for  all 
“ £ A and  that  (7)  and  (8)  -»  (6)  for  all  q (l<q<p)  -*  (5)  for  all 
n (i<n<k)  “*  (4)  for  all  m (i<m<k)  - (3)  for  all  •£>  (i<-<k)  "*  (2). 
Therefore 


T < Z u.v.P.r. 
" i-,  1 1 1 1 


for  all  “ € A , proving  (a). 
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Now  assume  that  the  u.'s,  v.'s,  P^s,  and  r.'s  are  distinct  and  non- 
zero. we  could  prove  (b J by  a more  careful  analysis  of  the  argument  used 
to  prove  (a).  In  fact  it  is  easier  to  prove  (b)  directly.  Recall  that 


b(»)  = Cfvgp  h'rjoJe  . 

i l l 


If 

(9) 


F - diagfil,  . . .,U  ) , G - diag(ii,  . . ii  ) , 


1 1 . 

0 

~tl . 0“ 

H1  - 

0 

"tl 

and 

E - 

l 

o ii 

- 0 _ 

0 

where  = i (i<i<k)  , then 


**' ■ '""V.'iV  -VkW 


and 


T«»  2 u.  v.  r,  P.  . 

i-l  1 1 1 1 


Conversely,  suppose  that 


k 

To  2 u.v  r.  P,  . 
i.i  1 1 1 1 


It  follows  frcm  Theorem  l and  (a)  that 


(10) 


bu(Q',  - vriPi 

bn(Qf)  + b22(a)  ■ v1r1p1  + Vzp2 


bu(a)  + + bkk(a)  ■ viripj  + " + Vkpk  • 

Ut  5 “ ffi  i fikvk } and  l1’  - (enri’**-» 

Cauchy-Schwarz  inequality 


ep, rp ) • Then  from  the 


bu(“)  - 5 0PlH;  «l  < (S'g)J?(T|,H1P12H’  >1) 


(n  ) 


with  equality 
F € 0{k) 
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(14) 


\’p  2X  < p 2\'\ 

l - l 


with  equality  if  and  only  if  ^ - 0 (2<i<k)  . From  Lemma  l 

(15)  < ‘I'll  . 

Combining  (H)-(15)  we  have 

bn(Qr)  ^ viripi 
with  equality  if  and  only  if 

G — diag(_l, G2 ) t F = diag(_i,F2 ) t 


~tl  o' 

~u 

0~ 

" 

0 H 

, E,  - 

0 

E 

21 

- 

2 

1 1 

“ 1 > < 

3,, F 6 0(k- 
2 2 

n » 

H2 

irn  u li 

E?  £ V(k-l,q-l)  . It  follows  from  (10)  that  b^f**)  " vzTZ(>2  ' We  can 
now  use  the  preceding  argument  with  G?,  F?,  H2>  and  E?  to  show  that 
f„  - -1>  - :i,  and  e„  - !i  where  ? zzZz*zze zz  - 1 • 


'22  ''  ”22  '22  22 
Continuing  in  this  manner  we  finally  have 


T - ^ uiviriPi 


implies  F,  G,  H , and  E]  satisfy  (9),  proving  (b). 
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1 


k. 


Corollary  l — 


Suppose  that 


(i) 


. . , V ) 

with 

V 

> 

v„ 

> • • • 

> 

V, 

> 

o ; 

’ k 

l 

2 

k 

— 

, . , d.  ) 

with 

d 

> 

d 

> . . . 

> 

0. 

> 

o ; 

’ k 

1 

2 

k 

— 

) 

with 

l 

> 

l 

> • • • 

> 

t 

> 

0 > 

* P 

1 

2 

— 

P 

— 

(iv)  G ^ o(k)  ; 

(v)  Hi  & V(k,p)  ; 

(vi)  Ej  € V(k,nj ) , (n^p)  ; and 

1 1 

( vii  ) T = tr{  LvGBjHjl/SoiEj ) . 

Then  for  f i xed  V,  ® , and  L 


k * h 

(a)  sup  T - s v-6i  S * 

G,Hi,E1  i-l 

Furthermore,  If 

(viii)  the  v. ' s,  d ' s,  and  •£». 1 s are  distinct  and  nonzero,  then 
l i i 

k i x 

(b ) T - S v.  9 ‘ u ‘ 

,11  1 


if  and  only  if  G - diag(li, . . ii  ) , 


h.  o' 

1 

o 

H 
.+  1 

H!  " 

0 ’*  + 1 

and 

o/;ii 

0 

0 

where  ■ 1 (i<i<k)  . 

Proof — 

T may  be  expressed  as 

T - tr  ( L^ciEj ) 

where  F - 1^  the  k x k identity  matrix.  Corollary  l now  follows 
directly  from  Theorem  2 by  adding  the  constraint  F « Ik  . 
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